Effects of using deep learning to predict the geographic origin of barley genebank accessions on genome–environment association studies
Che-Wei Chang, Karl Schmid

TL;DR
This paper explores using deep learning to predict the geographic origins of barley samples to improve genome-environment association studies for crop resilience.
Contribution
The novel use of neural networks to impute geographic origins and integrate them into genome-environment association studies is presented.
Findings
Neural networks accurately predicted geographic origins but sometimes produced ecologically implausible results.
Imputed data in GEA partially overlapped with regular GEA but also revealed new genomic regions near flowering time genes.
Increasing sample size via imputation did not significantly improve GEA performance as expected.
Abstract
Genome–environment association (GEA) is an approach for identifying adaptive loci by combining genetic variation with environmental parameters, offering potential for improving crop resilience. However, its application to genebank accessions is limited by missing geographic origin data. To address this limitation, we explored the use of neural networks to predict the geographic origins of barley accessions and integrate imputed environmental data into GEA. Neural networks demonstrated high accuracy in cross-validation but occasionally produced ecologically implausible predictions as models solely considered geographical proximity. For example, some predicted origins were located within non-arable regions, such as the Mediterranean Sea. Using barley flowering time genes as benchmarks, GEA integrating imputed environmental data (\documentclass[12pt]{minimal} \usepackage{amsmath}…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Mapping and Diversity in Plants and Animals · Wheat and Barley Genetics and Pathology · Genetic and phenotypic traits in livestock
