Genomic Region Detection via Spatial Convex Clustering
John Nagorski, Genevera I. Allen

TL;DR
This paper introduces SpaCC, a convex clustering method for identifying contiguous genomic regions from spatially registered probes, improving analysis of methylation and copy number variation data across multiple subjects.
Contribution
The paper presents a novel convex clustering approach tailored for genomic region detection, with scalable algorithms and automated tuning, outperforming existing methods in simulation and case studies.
Findings
SpaCC improves region detection accuracy over existing methods.
The method effectively handles missing data and tuning parameter selection.
Applications demonstrate its utility in cancer epigenetics analyses.
Abstract
Several modern genomic technologies, such as DNA-Methylation arrays, measure spatially registered probes that number in the hundreds of thousands across multiplechromosomes. The measured probes are by themselves less interesting scientifically; instead scientists seek to discover biologically interpretable genomic regions comprised of contiguous groups of probes which may act as biomarkers of disease or serve as a dimension-reducing pre-processing step for downstream analyses. In this paper, we introduce an unsupervised feature learning technique which maps technological units (probes) to biological units (genomic regions) that are common across all subjects. We use ideas from fusion penalties and convex clustering to introduce a method for Spatial Convex Clustering, or SpaCC. Our method is specifically tailored to detecting multi-subject regions of methylation, but we also test our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomic variations and chromosomal abnormalities · Epigenetics and DNA Methylation · Gene expression and cancer classification
