ComHapDet: A Spatial Community Detection Algorithm for Haplotype Assembly
Abishek Sankararaman, Haris Vikalo, Fran\c{c}ois Baccelli

TL;DR
ComHapDet introduces a novel community detection-based algorithm for haplotype assembly from sequencing data, effectively handling polyploid genomes and multi-allelic variants with improved accuracy and efficiency.
Contribution
The paper presents a new graphical framework and algorithm for haplotype assembly that models reads as a community detection problem, accommodating polyploidy and multi-allelic variants.
Findings
Performs well on simulated data.
Outperforms existing methods on experimental data.
Effective for polyploid and multi-allelic haplotypes.
Abstract
Background: Haplotypes, the ordered lists of single nucleotide variations that distinguish chromosomal sequences from their homologous pairs, may reveal an individual's susceptibility to hereditary and complex diseases and affect how our bodies respond to therapeutic drugs. Reconstructing haplotypes of an individual from short sequencing reads is an NP-hard problem that becomes even more challenging in the case of polyploids. While increasing lengths of sequencing reads and insert sizes {\color{black} helps improve accuracy of reconstruction}, it also exacerbates computational complexity of the haplotype assembly task. This has motivated the pursuit of algorithmic frameworks capable of accurate yet efficient assembly of haplotypes from high-throughput sequencing data. Results: We propose a novel graphical representation of sequencing reads and pose the haplotype assembly problem as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
