Bayesian Inference for Tumor Subclones Accounting for Sequencing and Structural Variants
Juhee Lee, Peter Mueller, Subhajit Sengupta, Kamalakar Gulukota, Yuan, Ji

TL;DR
This paper introduces a Bayesian model that jointly infers tumor subclonal copy numbers and sequence variants from sequencing data, improving understanding of tumor heterogeneity.
Contribution
It presents a novel Bayesian feature allocation model that simultaneously estimates subclonal structures and variants, accounting for sequencing and structural variants.
Findings
Enhanced accuracy in subclonal structure inference.
Effective joint modeling of copy number and sequence variants.
Validated with simulations and real data.
Abstract
Tumor samples are heterogeneous. They consist of different subclones that are characterized by differences in DNA nucleotide sequences and copy numbers on multiple loci. Heterogeneity can be measured through the identification of the subclonal copy number and sequence at a selected set of loci. Understanding that the accurate identification of variant allele fractions greatly depends on a precise determination of copy numbers, we develop a Bayesian feature allocation model for jointly calling subclonal copy numbers and the corresponding allele sequences for the same loci. The proposed method utilizes three random matrices, L, Z and w to represent subclonal copy numbers (L), numbers of subclonal variant alleles (Z) and cellular fractions of subclones in samples (w), respectively. The unknown number of subclones implies a random number of columns for these matrices. We use next-generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Genomics and Phylogenetic Studies · Genomics and Rare Diseases
