Applying traditional and machine learning-based GWAS approaches for marker-trait identification in wheat
Joel Joshua Milek, Sebastian Michel, Alexander Buchelt, Andreas Holzinger, Eva Maria Molin

TL;DR
This paper compares traditional and machine learning methods for identifying genetic markers linked to traits in wheat, showing that machine learning can detect additional markers.
Contribution
The study demonstrates how machine learning complements traditional GWAS by capturing non-linear genetic effects in wheat.
Findings
Traditional GWAS tools showed variability in runtime and marker-trait associations.
Machine learning models identified novel markers not detected by traditional methods.
ML approaches enhance detection of complex genetic signals in wheat traits.
Abstract
Complex traits arise from polygenic and interactive genomic architectures that are difficult to resolve using traditional genome-wide association study (GWAS) approaches. Machine learning (ML) provides complementary methods capable of capturing non-linear effects, improving signal detection, and enhancing predictive accuracy of marker trait associations (MTAs). Using a publicly available winter wheat dataset (CIMMYT), we evaluated several widely used traditional GWAS tools, including GAPIT, GCTA, GEMMA, sommer, and TASSEL, with respect to computational efficiency, model performance, and the consistency of detected associations. In parallel, ML approaches, such as Elastic Net, Extreme Gradient Boosting (XGBoost), Random Forest, and the hybrid TSLRF model, were assessed based on feature importance metrics and functional annotation of selected markers. Despite a shared reliance on mixed…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals · Genetic Associations and Epidemiology
