# AFruitDB: A comprehensive dataset of six commonly used Asian fruits for advanced grading and biodiversity insights

**Authors:** Mayen Uddin Mojumdar, Shahrin Islam, Md Al Mamun, Rifat Hasan, Shah Md Tanvir Siddiquee, Narayan Ranjan Chakraborty

PMC · DOI: 10.1016/j.dib.2025.111380 · Data in Brief · 2025-02-11

## TL;DR

AFruitDB is a dataset of six Asian fruits with 3,167 images collected to improve fruit grading and support biodiversity research using machine learning.

## Contribution

A novel dataset of six Asian fruits collected for advanced grading and biodiversity insights using mobile cameras.

## Key findings

- AFruitDB contains 3,167 images of six fruit types collected from local markets in Bangladesh.
- The dataset enables quality grading of fruits into good, medium, and bad categories.
- It supports biodiversity conservation and machine learning applications for grading and yield prediction.

## Abstract

The Asian subcontinent produces a vast range of fruits throughout the seasons. However, correctly classifying these fruits according to their qualities can be difficult, frequently necessitating the knowledge of fruit experts and cutting-edge equipment to produce accurate results. Therefore, to enable sophisticated grading methods that efficiently sort and evaluate fruit quality based on various characteristics (such as form, color, size, texture, and other crucial parameters), A unique dataset is deployed to support advanced grading systems. This dataset helps researchers explore genetic variation, ecological adaptation, and environmental factors that affect fruit qualities for conservation and sustainable agriculture. Using a mobile camera, these data are personally collected at various times of the day at local markets in Bangladesh that receive optimal sunlight. To create a unique dataset, 6 types of fruit consisting of 3,167 images have been collected. These six different types of fruit: apple, banana, burmese grape, mango, papaya, and tomato were used for quality grading, categorizing them as (i) good, (ii) medium, and (iii) bad. This dataset will help researchers in biodiversity conservation by building efficient machine-learning models and applying machine-learning techniques. Smart fruit grading, classification, and yield prediction automation systems can be built with this dataset.

## Full-text entities

- **Species:** Solanum lycopersicum (tomato, species) [taxon 4081], Musa acuminata (banana, species) [taxon 4641], Carica papaya (mamon, species) [taxon 3649], Malus domestica (apple, species) [taxon 3750], Mangifera indica (mango, species) [taxon 29780]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11889572/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC11889572/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/PMC11889572/full.md

---
Source: https://tomesphere.com/paper/PMC11889572