A nonparametric HMM for genetic imputation and coalescent inference
Lloyd T. Elliott, Yee Whye Teh

TL;DR
This paper introduces a nonparametric hidden Markov model based on the hierarchical Dirichlet process for genetic imputation and coalescent inference, offering a more flexible and parsimonious approach than existing models.
Contribution
It develops a truncation-free Bayesian nonparametric HMM with a new auxiliary sampling scheme, improving modeling of genetic data with nonhomogeneity and self transitions.
Findings
Outperforms the finite model fastPHASE in experiments.
Number of states correlates with the time to the most recent common ancestor.
Demonstrates flexibility of Bayesian nonparametrics for complex genetic data.
Abstract
Genetic sequence data are well described by hidden Markov models (HMMs) in which latent states correspond to clusters of similar mutation patterns. Theory from statistical genetics suggests that these HMMs are nonhomogeneous (their transition probabilities vary along the chromosome) and have large support for self transitions. We develop a new nonparametric model of genetic sequence data, based on the hierarchical Dirichlet process, which supports these self transitions and nonhomogeneity. Our model provides a parameterization of the genetic process that is more parsimonious than other more general nonparametric models which have previously been applied to population genetics. We provide truncation-free MCMC inference for our model using a new auxiliary sampling scheme for Bayesian nonparametric HMMs. In a series of experiments on male X chromosome data from the Thousand Genomes Project…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Genetic Associations and Epidemiology · Gene expression and cancer classification
