A Bayesian feature allocation model for tumor heterogeneity
Juhee Lee, Peter M\"uller, Kamalakar Gulukota, Yuan Ji

TL;DR
This paper introduces a Bayesian feature allocation model to analyze tumor heterogeneity by inferring latent haplotypes from sequencing data, capturing subclonal evolution and variation across tumor samples.
Contribution
It develops a Bayesian feature allocation framework based on a simplified Indian buffet process to model overlapping haplotypes in tumor heterogeneity analysis.
Findings
Successfully infers latent haplotypes from sequencing data
Captures tumor heterogeneity through varying haplotype proportions
Models subclonal evolution with overlapping haplotypes
Abstract
We develop a feature allocation model for inference on genetic tumor variation using next-generation sequencing data. Specifically, we record single nucleotide variants (SNVs) based on short reads mapped to human reference genome and characterize tumor heterogeneity by latent haplotypes defined as a scaffold of SNVs on the same homologous genome. For multiple samples from a single tumor, assuming that each sample is composed of some sample-specific proportions of these haplotypes, we then fit the observed variant allele fractions of SNVs for each sample and estimate the proportions of haplotypes. Varying proportions of haplotypes across samples is evidence of tumor heterogeneity since it implies varying composition of cell subpopulations. Taking a Bayesian perspective, we proceed with a prior probability model for all relevant unknown quantities, including, in particular, a prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
