Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies
Anand Bhaskar, Adel Javanmard, Thomas A. Courtade, David Tse

TL;DR
This paper introduces GAP, a probabilistic model that accurately infers spatial genetic ancestry and corrects for population stratification in GWAS, outperforming traditional PCA-based methods on diverse datasets.
Contribution
The authors develop a unified probabilistic framework and inference algorithm, GAP, that improves visualization and inference of population structure and enhances GWAS accuracy.
Findings
GAP outperforms PCA in reconstructing spatial ancestry coordinates.
GAP provides more accurate correction for population stratification in GWAS.
Method shows superior power in detecting true associations in real datasets.
Abstract
Genetic variation in human populations is influenced by geographic ancestry due to spatial locality in historical mating and migration patterns. Spatial population structure in genetic datasets has been traditionally analyzed using either model-free algorithms, such as principal components analysis (PCA) and multidimensional scaling, or using explicit spatial probabilistic models of allele frequency evolution. We develop a general probabilistic model and an associated inference algorithm that unify the model-based and data-driven approaches to visualizing and inferring population structure. Our algorithm, Geographic Ancestry Positioning (GAP), relates local genetic distances between samples to their spatial distances, and can be used for visually discerning population structure as well as accurately inferring the spatial origin of individuals on a two-dimensional continuum. On both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals
