# Enhancing interpretation of clinical disease-associated copy number variations from multiple sequencing strategies with CNVSeeker

**Authors:** Xudong Xiang, Xinxin Mao, Tengfei Luo, Chenbin Liu, Bozhao Li, Pei Yu, Yu Zhang, Dai Wu, Yijing Wang, Qiao Zhou, Yixiao Zhu, Bin Li, Kun Xia, Guihu Zhao, Jinchen Li

PMC · DOI: 10.1093/bioinformatics/btag034 · Bioinformatics · 2026-01-19

## TL;DR

CNVSeeker is a new tool that helps doctors and scientists interpret genetic copy number variations from different sequencing methods to better diagnose diseases like autism.

## Contribution

CNVSeeker is a novel, one-stop pipeline for analyzing and interpreting CNVs from multiple sequencing technologies with high accuracy.

## Key findings

- CNVSeeker outperforms existing methods in CNV calling accuracy.
- It achieved a diagnostic yield of ~6.3% in identifying ASD-associated CNVs in 1946 individuals.
- The tool provides a user-friendly web interface for visualizing results.

## Abstract

DNA copy number variations (CNVs) exert a profound impact on major genetic disorders in humans. Although multiple sequencing technologies have become the first line of molecular diagnosis for CNVs, existing tools are unable to resolve the pathogenicity of CNVs directly from raw sequencing data.

We developed CNVSeeker, a one-stop and easy-to-use pipeline that provides comprehensive analysis from raw sequencing data to variant interpretation reports, and supports multiple types of sequencing data including short-read data such as whole genome sequencing data and whole exome sequencing data, and long-read sequencing data from Pacific Biosciences HiFi platform or Oxford Nanopore Technologies platform. Through extensive benchmarking, CNVSeeker demonstrated comparable enhancement over the state-of-the-art methods for CNV calling. Moreover, CNVSeeker enables significantly precise variant classification with an accuracy of ∼87%. By applying CNVSeeker to 1946 individuals with autism spectrum disorder (ASD), a total of 133 ASD-associated CNVs in 122 patients were identified, yielding a diagnostic yield of ∼6.3%. Additionally, we have also provided a user-friendly webserver for intuitive visualization of results. This study highlights the potential of CNVSeeker to benefit clinicians and geneticists with limited bioinformatic skill by aiding them interpret CNVs directly from various types of raw sequencing data for auxiliary disease diagnosis.

The web server is freely available at https://genemed.tech/cnvseeker and the open-source code can be found at https://github.com/lovelycatZ/CNVSeeker.

## Linked entities

- **Diseases:** autism spectrum disorder (MONDO:0005258)

## Full-text entities

- **Diseases:** genetic disorders (MESH:D030342), ASD (MESH:D000067877)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12918764/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12918764/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/PMC12918764/full.md

---
Source: https://tomesphere.com/paper/PMC12918764