# Fruit-In-Sight: A deep learning-based framework for secondary metabolite class prediction using fruit and leaf images

**Authors:** Neeraja M. Krishnan, Saroj Kumar, Binay Panda, Eugenio Llorens, Eugenio Llorens, Eugenio Llorens, Eugenio Llorens

PMC · DOI: 10.1371/journal.pone.0308708 · PLOS ONE · 2024-08-08

## TL;DR

A deep learning model called Fruit-In-Sight uses fruit and leaf images to predict the concentration class of valuable metabolites in neem trees, avoiding costly lab tests.

## Contribution

A novel deep learning framework that predicts secondary metabolite concentration classes using only fruit and leaf images, validated with a mobile app.

## Key findings

- The best deep learning model achieved a test F1 score of 0.88 for predicting metabolite concentration classes.
- A multi-analyte framework improved sensitivity and achieved 100% specificity for both low and high metabolite classes.
- The framework was implemented in a mobile app to guide fruit collection based on predicted metabolite concentration.

## Abstract

Fruits produce a wide variety of secondary metabolites of great economic value. Analytical measurement of the metabolites is tedious, time-consuming, and expensive. Additionally, metabolite concentrations vary greatly from tree to tree, making it difficult to choose trees for fruit collection. The current study tested whether deep learning-based models can be developed using fruit and leaf images alone to predict a metabolite’s concentration class (high or low). We collected fruits and leaves (n = 1045) from neem trees grown in the wild across 0.6 million sq km, imaged them, and measured concentration of five metabolites (azadirachtin, deacetyl-salannin, salannin, nimbin and nimbolide) using high-performance liquid chromatography. We used the data to train deep learning models for metabolite class prediction. The best model out of the seven tested (YOLOv5, GoogLeNet, InceptionNet, EfficientNet_B0, Resnext_50, Resnet18, and SqueezeNet) provided a validation F1 score of 0.93 and a test F1 score of 0.88. The sensitivity and specificity of the fruit model alone in the test set were 83.52 ± 6.19 and 82.35 ± 5.96, and 79.40 ± 8.50 and 85.64 ± 6.21, for the low and the high classes, respectively. The sensitivity was further boosted to 92.67± 5.25 for the low class and 88.11 ± 9.17 for the high class, and the specificity to 100% for both classes, using a multi-analyte framework. We incorporated the multi-analyte model in an Android mobile App Fruit-In-Sight that uses fruit and leaf images to decide whether to ‘pick’ or ‘not pick’ the fruits from a specific tree based on the metabolite concentration class. Our study provides evidence that images of fruits and leaves alone can predict the concentration class of a secondary metabolite without using expensive laboratory equipment and cumbersome analytical procedures, thus simplifying the process of choosing the right tree for fruit collection.

## Linked entities

- **Chemicals:** azadirachtin (PubChem CID 5281303), deacetyl-salannin (PubChem CID 14458886), salannin (PubChem CID 6437066), nimbin (PubChem CID 108058), nimbolide (PubChem CID 12313376)
- **Species:** Azadirachta indica (taxon 124943)

## Full-text entities

- **Chemicals:** salannin (MESH:C105836), nimbin (MESH:C008621), deacetyl-salannin (-), azadirachtin (MESH:C010329), nimbolide (MESH:C042198)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11309380/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11309380/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC11309380/full.md

---
Source: https://tomesphere.com/paper/PMC11309380