# The digital orchard: advanced data-driven technologies in apple breeding and genetic modification

**Authors:** Fazeel Abid, Zhao Zhang, Ghulam Farooque, Rana Muhammad Zulqarnain, Jawad Rasheed, Onur Osman, Shtwai Alsubai, Leila Jamel

PMC · DOI: 10.3389/fpls.2025.1725617 · Frontiers in Plant Science · 2026-01-12

## TL;DR

This paper reviews how advanced data-driven technologies are transforming apple breeding by speeding up trait selection and genetic modification.

## Contribution

The paper systematically synthesizes the convergence of high-throughput phenotyping, machine learning, and CRISPR in apple breeding.

## Key findings

- High-throughput phenotyping using sensors like LiDAR and hyperspectral imaging automates trait data collection at scale.
- Machine learning models achieve over 96% accuracy in cultivar identification and improve genomic selection by 18%.
- CRISPR-based genome editing enables rapid introduction of traits like disease resistance and improved shelf life.

## Abstract

The apple (Malus × domestica), a globally significant perennial fruit crop, faces immense pressure from climate change, evolving pathogens, and consumer demand for novel traits. Also, remains constrained by slow trait selection despite technological advances. Further, the traditional breeding methods are slow and resource-intensive, hampered by the apple’s long juvenile period and high heterozygosity. This systematic literature review (SLR) synthesizes the state of the art in advanced data-driven technologies for accelerating apple breeding and genetic modification. Following the PRISMA-EcoEvo protocol, 47 selected studies were analyzed from databases including Web of Science, Scopus, and PubMed. Our thematic synthesis reveals a paradigm shift towards a “digital breeding” model, characterized by the convergence of three core technological pillars. First, high-throughput phenotyping (HTP), which leverages sensor modalities such as RGB-D, hyperspectral imaging, and LiDAR, is automating the collection of trait data at an unprecedented scale. Second, machine learning (ML) and deep learning (DL) algorithms are being deployed for diverse applications, including cultivar identification with over 96% accuracy, non-destructive quality prediction, and genomic selection, thereby boosting predictive ability for key traits by up to 18%. Third, precise and efficient genome editing, predominantly using Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated protein 9 (Cas9), is enabling the rapid introduction of desirable traits, such as disease resistance, enhanced shelf life, and improved nutrient uptake. Demonstrated transgene-free editing protocols are accelerating the path to commercialization. We further explore the integration of these pillars through the agricultural internet of things (AIoT) and discuss emerging frontiers, including federated learning for data privacy, explainable AI (XAI) for model transparency, and the implications of recent regulatory frameworks. This review identifies critical research gaps, including the need for standardized open-access datasets and integrated end-to-end system validation. It concludes that the synergistic application of these technologies is poised to revolutionize the speed, precision, and resilience of apple improvement programs worldwide.

## Linked entities

- **Species:** Malus domestica (taxon 3750)

## Full-text entities

- **Species:** Malus domestica (apple, species) [taxon 3750]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12833213/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12833213/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC12833213/full.md

---
Source: https://tomesphere.com/paper/PMC12833213