Automated deconvolution of structured mixtures from bulk tumor genomic data
Theodore Roman, Lu Xie, Russell Schwartz

TL;DR
This paper presents advanced computational methods to improve the deconvolution of bulk tumor genomic data, especially CNV data, enabling more accurate identification of cellular subpopulations and mixture structures.
Contribution
The authors introduce automated learning techniques for genomic mixture substructure, including dimensionality estimation, fuzzy clustering, and model inference, enhancing prior deconvolution methods.
Findings
Improved accuracy in simulated mixture scenarios.
Effective identification of tumor substructure in real CNV data.
Enhanced methods outperform previous approaches in noisy, sparse data.
Abstract
Motivation: As cancer researchers have come to appreciate the importance of intratumor heterogeneity, much attention has focused on the challenges of accurately profiling heterogeneity in individual patients. Experimental technologies for directly profiling genomes of single cells are rapidly improving, but they are still impractical for large-scale sampling. Bulk genomic assays remain the standard for population-scale studies, but conflate the influences of mixtures of genetically distinct tumor, stromal, and infiltrating immune cells. Many computational approaches have been developed to deconvolute these mixed samples and reconstruct the genomics of genetically homogeneous clonal subpopulations. All such methods, however, are limited to reconstructing only coarse approximations to a few major subpopulations. In prior work, we showed that one can improve deconvolution of genomic data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
