# ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets

**Authors:** Peizheng Mu, Xiangyan Feng, Lanxin Tong, Jie Huang, Chaoqun Zhu, Fei Wang, Wei Quan, Yuanjun Ma, Yucui Dong, Xiao Zhu

PMC · DOI: 10.1016/j.csbj.2025.06.045 · Computational and Structural Biotechnology Journal · 2025-06-29

## TL;DR

ASVBM is a new benchmarking framework for structural variant detection that improves accuracy by jointly analyzing multiple variants.

## Contribution

ASVBM introduces latent positives and a joint analysis strategy to better match variants across callsets.

## Key findings

- Multiple small variants can be equivalent to a larger variant, improving benchmarking accuracy.
- ASVBM reduces false mismatches by jointly analyzing local variants in callsets.
- ASVBM was evaluated using real WGS datasets and six state-of-the-art pipelines.

## Abstract

Accurate benchmarking of structural variant (SV) detection is essential for advancing the development and application of human whole-genome sequencing (WGS). A fundamental challenge in benchmarking SV detection results is determining whether two SVs represent the same event. Differences in the variation-awareness and strategic implementation of aligners inherently constrain SV detection algorithms that rely on alignment-based approaches. Traditional benchmarking, which primarily focuses on comparing and matching individual variants, makes it difficult to capture the relationships between multiple adjacent variants. We introduced ASVBM, an improved benchmarking framework that introduces the notion of latent positives and leverages a joint analysis and validation strategy based on local variants. This performance improvement arose from the discovery that multiple smaller variants are nearly equivalent to a larger variant. We comprehensively evaluated the performance of six state-of-the-art variant calling pipelines using real WGS datasets. According to multiple matching criteria, ASVBM employs a joint analysis strategy to uncover potential equivalences between the callset and the benchmark set, thereby reducing false mismatches caused by differences in variant representation. ASVBM is available at https://github.com/zhuxiao/asvbm.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12271604/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12271604/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12271604/full.md

---
Source: https://tomesphere.com/paper/PMC12271604