# Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

**Authors:** Yujie Zhang, Sabine Struckmeyer, Andreas Kolb, Sven Reichardt

PMC · DOI: 10.1038/s41597-026-06926-9 · Scientific Data · 2026-02-28

## TL;DR

TomatoMAP is a new dataset for tomato plant analysis that reduces observer bias and improves accuracy in phenotyping.

## Contribution

TomatoMAP introduces a large annotated dataset and benchmarks AI models for fine-grained tomato phenotyping.

## Key findings

- TomatoMAP includes 68,080 images with detailed annotations for plant regions and growth stages.
- AI models trained on TomatoMAP achieve comparable accuracy to human experts in phenotyping.
- Benchmarking shows trade-offs between model accuracy and efficiency for real-time applications.

## Abstract

Observer bias and inconsistencies in traditional plant phenotyping methods limit the accuracy and reproducibility of fine-grained plant analysis. To address these limitations, TomatoMAP is introduced as a comprehensive dataset for Solanum lycopersicum. The dataset contains 68,080 RGB images: 3,616 high-resolution macrophotographs (3648 × 5472) with semantic annotations, and 64,464 moderate-resolution images (1080 × 1440) captured from 12 plant poses at four camera elevations. Each image is accompanied by manually annotated bounding boxes for seven regions of interest (leaves, panicle, flower clusters, fruit clusters, axillary shoot, shoot, and whole-plant area) and by labels spanning 50 BBCH classes representing phenologically growth stages. A general cascading structure is proposed. For real-time applicability, models emphasizing the accuracy-efficiency trade-off (MobileNetv3, YOLOv11, and Mask R-CNN) are prioritized and benchmarked against multiple state-of-the-art models. Performance is assessed using accuracy, mAP, inference FPS, and normalized confusion matrices. In a study involving five domain experts, AI models trained on TomatoMAP achieves comparable accuracy levels. Reliability of automated fine-grained phenotyping is supported by Cohen’s Kappa statistics and inter-rater agreement heatmaps.

## Linked entities

- **Species:** Solanum lycopersicum (taxon 4081)

## Full-text entities

- **Diseases:** leaf disease (MESH:D004194), nutrient deficiencies (MESH:D007153)
- **Chemicals:** TomatoMAP (-)
- **Species:** Solanum lycopersicum (tomato, species) [taxon 4081], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** YOLOv11 — Homo sapiens (Human), Transformed cell line (CVCL_C1JD)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12954114/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12954114/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12954114/full.md

---
Source: https://tomesphere.com/paper/PMC12954114