CGHTRIMMER: Discretizing noisy Array CGH Data
Charalampos E. Tsourakakis, David Tolliver, Maria A. Tsiarli, Stanley, Shackney, Russell Schwartz

TL;DR
CGHTRIMMER is a fast, accurate segmentation method for noisy array CGH data that improves detection of genomic copy number variations in cancer research.
Contribution
It introduces a novel dynamic programming-based segmentation algorithm that outperforms existing methods in accuracy and speed for aCGH data analysis.
Findings
Achieves higher precision and recall than competitors.
Finds novel genomic markers supported by literature.
Runs 100 to 1000 times faster than existing methods.
Abstract
The development of cancer is largely driven by the gain or loss of subsets of the genome, promoting uncontrolled growth or disabling defenses against it. Identifying genomic regions whose DNA copy number deviates from the normal is therefore central to understanding cancer evolution. Array-based comparative genomic hybridization (aCGH) is a high-throughput technique for identifying DNA gain or loss by quantifying total amounts of DNA matching defined probes relative to healthy diploid control samples. Due to the high level of noise in microarray data, however, interpretation of aCGH output is a difficult and error-prone task. In this work, we tackle the computational task of inferring the DNA copy number per genomic position from noisy aCGH data. We propose CGHTRIMMER, a novel segmentation method that uses a fast dynamic programming algorithm to solve for a least-squares objective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomic variations and chromosomal abnormalities · Chromosomal and Genetic Variations · Genomics and Phylogenetic Studies
