CN-SBM: Categorical Block Modelling For Primary and Residual Copy Number Variation
Kevin Lam, William Daniels, J Maxwell Douglas, Daniel Lai, Samuel Aparicio, Benjamin Bloem-Reddy, Yongjin Park

TL;DR
CN-SBM is a probabilistic model that jointly clusters samples and genomic regions based on discrete copy number states, capturing tumor heterogeneity and aiding prognosis in cancer genomics.
Contribution
It introduces a novel bipartite categorical block model for CNV data, with a scalable inference algorithm, improving over existing methods in accuracy and interpretability.
Findings
Improved fit on simulated and real datasets.
Revealed clinically relevant subtypes in glioma.
Enhanced patient stratification for prognosis.
Abstract
Cancer is a genetic disorder whose clonal evolution can be monitored by tracking noisy genome-wide copy number variants. We introduce the Copy Number Stochastic Block Model (CN-SBM), a probabilistic framework that jointly clusters samples and genomic regions based on discrete copy number states using a bipartite categorical block model. Unlike models relying on Gaussian or Poisson assumptions, CN-SBM respects the discrete nature of CNV calls and captures subpopulation-specific patterns through block-wise structure. Using a two-stage approach, CN-SBM decomposes CNV data into primary and residual components, enabling detection of both large-scale chromosomal alterations and finer aberrations. We derive a scalable variational inference algorithm for application to large cohorts and high-resolution data. Benchmarks on simulated and real datasets show improved model fit over existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Genomic variations and chromosomal abnormalities · Glioma Diagnosis and Treatment
