# Updating high-resolution image dataset for the automatic classification of phenological stage and identification of racemes in Urochloa spp. hybrids with expanded images and annotations

**Authors:** Darwin Alexis Arrechea-Castillo, Paula Espitia-Buitrago, David Florian-Vargas, Ronald David Arboleda, Riquelmer Velázquez-Hernández, Andrés Felipe Ruiz-Hurtado, Luis Miguel Hernandez, Rosa N. Jauregui, Juan Andrés Cardoso

PMC · DOI: 10.1016/j.dib.2025.111593 · Data in Brief · 2025-04-28

## TL;DR

This paper presents an expanded dataset of high-resolution images for classifying phenological stages and identifying racemes in Urochloa hybrids, improving deep learning model training.

## Contribution

The dataset has been expanded with more images, annotations, and diverse capture conditions to enhance deep learning model robustness.

## Key findings

- The dataset now includes 2539 images and 47,323 raceme annotations, with increased diversity in capture conditions.
- Raceme density per plant has nearly doubled in some samples, enabling advanced instance segmentation tasks.
- Images were captured using various devices and perspectives, improving model adaptability for real-world applications.

## Abstract

This dataset is an expanded version of a previously published collection of high-resolution RGB images of Urochloa spp. genotypes, initially designed to facilitate automated classification of phenological stages and raceme identification in forage breeding trials. The original dataset included 2400 images of 200 genotypes captured under controlled conditions, supporting the development of computer vision models for High-Throughput Phenotyping (HTP). In this updated release, 139 additional images and 24,983 new annotations have been added, bringing the dataset to a total of 2539 images and 47,323 raceme annotations. This version introduces increased diversity in image-capture conditions, with data collected from two geographic locations (Palmira, Colombia, and Ocozocoautla de Espinosa, Mexico) and a range of image-capture devices, including smartphones (e.g. Realme C53 and Oppo Reno 11), a Nikon D5600 camera, and a Phantom 4 Pro V2 drone. Images now vary in perspective (nadir, high-angle, and frontal) and capture distance (1–3 meters), enhancing the dataset applicability for robust Deep Learning (DL) models. Compared to the original dataset, raceme density per plant has nearly doubled in some samples, offering higher raceme overlap for advanced instance segmentation tasks. This expanded dataset supports deeper exploration of phenotypic variation in Urochloa spp. and offers greater potential for developing adaptable models in crop phenotyping.

## Full-text entities

- **Chemicals:** raceme (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12136700/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12136700/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/PMC12136700/full.md

---
Source: https://tomesphere.com/paper/PMC12136700