Improving Genomic Prediction using High-dimensional Secondary Phenotypes: the Genetic Latent Factor Approach
Killian A. C. Melsen, Jonathan F. Kunst, Jos\'e Crossa, Margaret R. Krause, Fred A. van Eeuwijk, Willem Kruijer, Carel F. W. Peeters

TL;DR
This paper introduces glfBLUP, a new method that uses latent factors derived from high-dimensional secondary phenotypes to improve genomic prediction accuracy in plant breeding.
Contribution
The paper presents a novel genetic latent factor approach that reduces dimensionality and enhances prediction accuracy while maintaining interpretability.
Findings
glfBLUP outperforms existing methods in simulations and real data
produces biologically meaningful parameters
offers a flexible framework for multi-trait prediction
Abstract
Decreasing costs and new technologies have led to an increase in the amount of data available to plant breeding programs. High-throughput phenotyping (HTP) platforms routinely generate high-dimensional datasets of secondary features that may be used to improve genomic prediction accuracy. However, integration of these data comes with challenges such as multicollinearity, parameter estimation in settings, and the computational complexity of many standard approaches. Several methods have emerged to analyze such data, but interpretation of model parameters often remains challenging. We propose genetic latent factor best linear unbiased prediction (glfBLUP), a prediction pipeline that reduces the dimensionality of the original secondary HTP data using generative factor analysis. In short, glfBLUP uses redundancy filtered and regularized genetic and residual correlation matrices to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
