High-dimensional multi-trait GWAS by reverse prediction of genotypes
Muhammad Ammar Malik, Adriaan-Alexander Ludl, Tom Michoel

TL;DR
This paper explores the use of reverse regression methods, including machine learning techniques, for high-dimensional multi-trait GWAS to improve the identification of genetic associations with multiple traits.
Contribution
It systematically evaluates various machine learning methods for reverse regression in multi-trait GWAS, demonstrating their ability to predict genotypes and identify trait-associated genetic regions.
Findings
Genotype prediction performance distinguishes high and low transcriptional activity regions.
Model coefficients correlate with trait-variant associations and trans-eQTL targets.
Methods show complementary strengths in identifying genetic associations.
Abstract
Multi-trait genome-wide association studies (GWAS) use multi-variate statistical methods to identify associations between genetic variants and multiple correlated traits simultaneously, and have higher statistical power than independent univariate analyses of traits. Reverse regression, where genotypes of genetic variants are regressed on multiple traits simultaneously, has emerged as a promising approach to perform multi-trait GWAS in high-dimensional settings where the number of traits exceeds the number of samples. We analyzed different machine learning methods (ridge regression, naive Bayes/independent univariate, random forests and support vector machines) for reverse regression in multi-trait GWAS, using genotypes, gene expression data and ground-truth transcriptional regulatory networks from the DREAM5 SysGen Challenge and from a cross between two yeast strains to evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Mapping and Diversity in Plants and Animals · Gene expression and cancer classification · Genetic and phenotypic traits in livestock
