The next frontier for genomic prediction is computational
Lauren M McIntyre

TL;DR
This paper discusses the growing importance of computational methods in genomic prediction and invites new research and debate on the topic.
Contribution
The paper highlights the need for new computational approaches to improve genomic prediction.
Findings
Genomic prediction is becoming more dynamic with new technologies.
Computational models face challenges in integrating different data types.
The paper invites submissions to advance the field through discussion and research.
Abstract
From the seminal paper by Meeuwissen et al. 2001 to the GSA Journals Series launch in 2015, the field of Genomic Prediction continues to gain momentum. The field is increasingly dynamic, with new technology increasing the scale and scope of the data available. Significant challenges exist in building computational models. Questions of how to appropriately account for different types of data, and which data improve predictions are interwoven. What is the best path forward? What methods will improve predictions? Authors can submit their rejoinder or start a new discussion on one of the many important topics in the field by submitting a Dialogue and Debate article for peer review at G3.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genomics and Phylogenetic Studies · Genomics and Rare Diseases
From the seminal paper by Meuwissen et al. (2001) to the GSA Journals Series launch in 2015, the field of Genomic Prediction continues to gain momentum. The field is increasingly dynamic, with new technology increasing the scale and scope of the data available. Significant challenges exist in building computational models. Questions of how to appropriately account for different types of data and which data improve predictions are interwoven. Recent papers at the GSA journals have endeavored to address relatedness and population structure (Pocrnic et al. 2024), historical data (Costa-Neto et al. 2023; Crossa et al. 2025; Vitale et al. 2025), phenotypic data from satellite images (Morales et al. 2024 ), hyperspectral imaging (Concepcion et al. 2025), physiological models, environmental interactions and variation in management practice, genome interactions (Yang et al. 2023 ), genome and environment interactions (Xavier et al. 2024), missing data, copy number/ploidy (Osorio-Guarin et al. 2024; Wilson et al. 2024; Endelman 2025; Tessele et al. 2025), low-input breeding potential (Olsson et al. 2025 ), and the genetic architecture of the trait of interest (Gibbs et al. 2025) into increasingly complex models.
What is the best path forward? What methods will improve predictions? Deep learning (Montesinos-López et al. 2023, 2024a, 2024b; Kihlman et al. 2024)? Regularization (Montesinos-López et al. 2024a, 2024b)? Ensemble models (Tomura et al. 2025)? Or something simpler (Ahlinder et al. 2024)? Fundamentally, we are all engaged with trying to understand how we can best predict beyond the environmental data at hand (Hu et al. 2025) and how important it is to have similar environments in the training data (Rogers and Holland 2022).
Howard and Lipka (2025) argue that we should consider simpler models in “Genomic selection and reproducibility: are complex models distracting us from true scientific validity in the presence of genotype-by-environment interaction,” the latest Dialogue and Debate published in G3. Do you share this perspective? You can submit your rejoinder or start a new discussion on one of the many important topics in the field by submitting your Dialogue and Debate article for peer review at G3.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ahlinder J, Hall D, Suontama M, Sillanpää MJ. 2024. Principal component analysis revisited: fast multitrait genetic evaluations with smooth convergence. G 3 (Bethesda). 14:jkae 228. 10.1093/g 3journal/jkae 228.39429114 PMC 11631533 · doi ↗ · pubmed ↗
- 2Concepcion JS, Noble AD, Thompson AM, Dong Y, Olson EL. 2025. Genomic and hyperspectral imaging-based prediction blending enables selection for reduced deoxynivalenol content in wheat grains. G 3 (Bethesda). 15:jkaf 176. 10.1093/g 3journal/jkaf 176.40795041 PMC 12506668 · doi ↗ · pubmed ↗
- 3Costa-Neto G et al 2023. Envirome-wide associations enhance multi-year genome-based prediction of historical wheat breeding data. G 3 (Bethesda). 13:jkac 313. 10.1093/g 3journal/jkac 313.36454213 PMC 9911085 · doi ↗ · pubmed ↗
- 4Crossa J et al 2025. Evaluating the effectiveness of selection indices and their genomic prediction using environmental and historical rice data. G 3 (Bethesda). 15:jkaf 087. 10.1093/g 3journal/jkaf 087.40238949 PMC 12135014 · doi ↗ · pubmed ↗
- 5Endelman JB . 2025. Genomic prediction of heterosis, inbreeding control, and mate allocation in outbred diploid and tetraploid populations. Genetics. 229:iyae 193. 10.1093/genetics/iyae 193.39552210 · doi ↗ · pubmed ↗
- 6Gibbs PM, Paril JF, Fournier-Level A. 2025. Trait genetic architecture and population structure determine model selection for genomic prediction in natural Arabidopsis thaliana populations. Genetics. 229:iyaf 003. 10.1093/genetics/iyaf 003.39814947 · doi ↗ · pubmed ↗
- 7Howard R, Lipka AE. 2025. Genomic selection and reproducibility: are complex models distracting us from true scientific validity in the presence of genotype-by-environment interaction?. G 3 (Bethesda). 15:jkaf 244. 10.1093/g 3journal/jkaf 244.41182845 PMC 12693495 · doi ↗ · pubmed ↗
- 8Hu H, Rincent R, Runcie DE. 2025. Mega LMM improves genomic predictions in new environments using environmental covariates. Genetics. 229:iyae 171. 10.1093/genetics/iyae 171.39471330 PMC 11708919 · doi ↗ · pubmed ↗
