ZIPcnv: accurate and efficient inference of copy number variations from shallow whole-genome sequencing
Zhengfa Xue, Jingyu Zeng, Xuwen Wang, Jiajing Yuan, Tianci Wang, Xin Lai, Lin Wang, Yu Wang, Huanhuan Zhu, Xin Jin, Jiayin Wang

TL;DR
ZIPcnv is a new tool that improves the detection of copy number variations from shallow whole-genome sequencing data by addressing zero-inflation and noise issues.
Contribution
ZIPcnv introduces a novel statistical approach combining smoothing and adaptive windowing to enhance CNV detection accuracy in sWGS data.
Findings
ZIPcnv outperforms existing tools in detecting CNVs from shallow whole-genome sequencing data.
The method effectively handles zero-inflation and noise through a cumulative sum strategy and dynamic windowing.
Evaluation on simulated and real data confirms ZIPcnv's consistent performance improvements.
Abstract
Shallow whole-genome sequencing (sWGS), a rapid and cost-effective sequencing technology, has gradually been widely adopted for CNV analyses. However, with genome‑wide coverage of only 0.1–5×, sWGS data display a pronounced zero‑inflation phenomenon—a large fraction of loci has zero sequencing reads. Zero inflation causes read counts to fluctuate by several‑fold between adjacent windows. As a result, random upward blips in coverage can be misinterpreted as copy‑number gains (false positives), and true deletions often become indistinguishable from pervasive zero‑coverage noise. In addition, existing CNV detection tools developed for sWGS data often struggle to adapt across different CNV sizes. These combined effects severely constrain the accuracy of CNV inference. To address above challenges, we propose ZIPcnv, a novel CNV detection tool specifically designed for sWGS data. First, we…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Genomic variations and chromosomal abnormalities · Genomics and Phylogenetic Studies
