CLEVER: Clique-Enumerating Variant Finder
Tobias Marschall, Ivan Costa, Stefan Canzar, Markus Bauer, Gunnar, Klau, Alexander Schliep, Alexander Sch\"onhuth

TL;DR
CLEVER introduces a novel graph-based algorithm for detecting structural variants, especially indels of 20-100 base pairs, outperforming existing methods on simulated and real sequencing data.
Contribution
It presents a new clique-enumeration approach for structural variant detection that improves accuracy for challenging indel sizes and compares favorably with state-of-the-art methods.
Findings
Superior detection of 20-100 bp indels compared to existing methods.
Effective on both simulated and real sequencing data.
Provides a comprehensive comparison with other approaches.
Abstract
Next-generation sequencing techniques have facilitated a large scale analysis of human genetic variation. Despite the advances in sequencing speeds, the computational discovery of structural variants is not yet standard. It is likely that many variants have remained undiscovered in most sequenced individuals. Here we present a novel internal segment size based approach, which organizes all, including also concordant reads into a read alignment graph where max-cliques represent maximal contradiction-free groups of alignments. A specifically engineered algorithm then enumerates all max-cliques and statistically evaluates them for their potential to reflect insertions or deletions (indels). For the first time in the literature, we compare a large range of state-of-the-art approaches using simulated Illumina reads from a fully annotated genome and present various relevant performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genomics and Rare Diseases · Genomic variations and chromosomal abnormalities
