# TomatoPGT: A 3D point cloud dataset of tomato plants for segmentation and plant-trait extraction

**Authors:** Prasad Nethala, Dugan Um, Samantha L. McCoy, Seth Gibson, Mahendra Bhandari, Kiju Lee

PMC · DOI: 10.1016/j.dib.2026.112642 · Data in Brief · 2026-03-06

## TL;DR

TomatoPGT is a 3D dataset of tomato plants that helps researchers study plant structure and traits without damaging the plants.

## Contribution

TomatoPGT introduces a 3D point cloud dataset with detailed annotations and graph-based representations for plant phenotyping.

## Key findings

- The dataset includes 42 scans of tomato plants with RGB images and dense colored point clouds.
- Manual annotations and graph-based traits like internode length and angles are provided for research use.

## Abstract

Three-dimensional (3D) point-cloud phenotyping enables non-destructive and repeatable characterization of plant architecture, supporting the measurement of traits such as internode length, branching topology, and organ orientation. This article presents TomatoPGT (Tomato Plant Graph Twin), a 3D tomato dataset designed for research on semantic/instance segmentation, graph-based structural representation, and graph-derived phenotypic trait extraction.

The dataset contains 42 scans from three greenhouse-grown tomato plants acquired across early to mid-vegetative development using a rotational multi-view imaging system. Each scan consists of 60–70 overlapping RGB images captured under uniform illumination and reconstructed into a metrically scaled dense colored point cloud using Structure-from-Motion and multi-view stereo. TomatoPGT provides: (i) multi-view RGB images, (ii) dense colored point clouds, (iii) manually curated semantic and instance annotations at organ level, (iv) graph representations encoding plant topology and geometry, and (v) tabulated phenotypic traits computed deterministically from the graphs (internode length, insertion angles, and phyllotactic angles). TomatoPGT supports reproducible development and evaluation of 3D phenotyping pipelines, including learning-based segmentation and graph-based modeling of plant architecture.

## Full-text entities

- **Species:** Solanum lycopersicum (tomato, species) [taxon 4081]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13000484/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13000484/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC13000484/full.md

---
Source: https://tomesphere.com/paper/PMC13000484