rfPhen2Gen: A machine learning based association study of brain imaging phenotypes to genotypes
Muhammad Ammar Malik, Alexander S. Lundervold, Tom Michoel

TL;DR
This study uses machine learning, specifically random forest regression, to predict genotypes from brain imaging traits, revealing novel associations and providing a non-linear alternative to traditional GWAS methods.
Contribution
It introduces the application of random forest regression for genotype prediction from imaging phenotypes, highlighting its advantages over linear models in uncovering new associations.
Findings
Random forest regression outperformed linear models in SNP prediction.
Identified SNPs associated with brain regions linked to Alzheimer's disease.
Non-linear models reveal additional genotype-phenotype associations.
Abstract
Imaging genetic studies aim to find associations between genetic variants and imaging quantitative traits. Traditional genome-wide association studies (GWAS) are based on univariate statistical tests, but when multiple traits are analyzed together they suffer from a multiple-testing problem and from not taking into account correlations among the traits. An alternative approach to multi-trait GWAS is to reverse the functional relation between genotypes and traits, by fitting a multivariate regression model to predict genotypes from multiple traits simultaneously. However, current reverse genotype prediction approaches are mostly based on linear models. Here, we evaluated random forest regression (RFR) as a method to predict SNPs from imaging QTs and identify biologically relevant associations. We learned machine learning models to predict 518,484 SNPs using 56 brain imaging QTs. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Bioinformatics and Genomic Networks · Folate and B Vitamins Research
MethodsFeature Selection
