Bayesian Sparse Factor Analysis of Genetic Covariance Matrices
Daniel E Runcie, Sayan Mukherjee

TL;DR
This paper introduces a Bayesian sparse factor model for estimating high-dimensional genetic covariance matrices, improving interpretability and robustness in gene expression studies by assuming modular, sparse underlying factors.
Contribution
The authors develop a novel Bayesian sparse factor model tailored for high-dimensional genetic covariance estimation, incorporating biological plausibility and sparsity constraints.
Findings
Model effectively captures genetic covariance structure in simulated data.
Application to Drosophila gene expression data reveals biologically meaningful factors.
Sparsity enhances interpretability and reduces sampling errors in high-dimensional settings.
Abstract
Quantitative genetic studies that model complex, multivariate phenotypes are important for both evolutionary prediction and artificial selection. For example, changes in gene expression can provide insight into developmental and physiological mechanisms that link genotype and phenotype. However, classical analytical techniques are poorly suited to quantitative genetic studies of gene expression where the number of traits assayed per individual can reach many thousand. Here, we derive a Bayesian genetic sparse factor model for estimating the genetic covariance matrix (G-matrix) of high-dimensional traits, such as gene expression, in a mixed effects model. The key idea of our model is that we need only consider G-matrices that are biologically plausible. An organism's entire phenotype is the result of processes that are modular and have limited complexity. This implies that the G-matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
