pHapCompass: Probabilistic Assembly and Uncertainty Quantification of Polyploid Haplotype Phase
Marjan Hosseini (1), Ella Veiner (1), Thomas Bergendahl (1), Tala Yasenpoor (1), Zane Smith (2), Margaret Staton (2), Derek Aguiar (1, 3) ((1) School of Computing, University of Connecticut, (2) Department of Entomology, Plant Pathology, University of Tennessee

TL;DR
pHapCompass introduces a probabilistic approach for assembling haplotypes in polyploid genomes, explicitly modeling read ambiguity and providing uncertainty quantification, which improves accuracy and realism over synthetic benchmarks.
Contribution
It develops a novel probabilistic algorithm for polyploid haplotype assembly that explicitly models read assignment ambiguity and quantifies uncertainty, addressing limitations of prior methods.
Findings
pHapCompass achieves competitive accuracy across various polyploid complexities.
The method provides reliable uncertainty quantification for haplotype phases.
Benchmarking shows improved performance over existing assemblers.
Abstract
Computing haplotypes from sequencing data, i.e. haplotype assembly, is an important component of molecular and population genetics problems, including interpreting the effects of genetic variation on complex traits and reconstructing genealogical relationships. Assembling the haplotypes of polyploid genomes remains a significant challenge due to the exponential search space of haplotype phasings and read assignment ambiguity; the latter challenge is particularly difficult for haplotype assemblers since the information contained within the observed sequence reads is often insufficient for unambiguous haplotype assignment in polyploid genomes. We present pHapCompass, probabilistic haplotype assembly algorithms for diploid and polyploid genomes that explicitly model and propagate read assignment ambiguity to compute a distribution over polyploid haplotype phasings. We develop graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic Associations and Epidemiology · Genomics and Rare Diseases
