# DoBSeqWF: a framework for sensitive detection of individual genetic variation in pooled sequencing data

**Authors:** Mads Cort Nielsen, Christian Munch Hagen, Ulrik Kristoffer Stoltze, Thomas van Overeem Hansen, Mette Nyegaard, Henrik Hjalgrim, Marie Bækvad-Hansen, Anna Byrjalsen, Kjeld Schmiegelow, Karin Wadt, Jonas Bybjerg-Grauholm, Simon Rasmussen

PMC · DOI: 10.1093/nargab/lqag021 · 2026-02-16

## TL;DR

DoBSeqWF is a new pipeline that improves the detection of rare genetic variants in cost-effective pooled sequencing data, helping with early diagnosis of genetic diseases.

## Contribution

DoBSeqWF introduces a specialized Nextflow-based workflow with machine learning filters for accurate rare variant detection in double-batched sequencing data.

## Key findings

- DoBSeqWF accurately detects rare variants in pooled sequencing data with high sensitivity.
- Machine learning filters improve variant detection while maintaining scalability.
- The pipeline was validated using a childhood cancer cohort with whole genome sequencing as a reference.

## Abstract

Population screening for rare genetic diseases has the potential to increase early diagnosis and treatment, but the high cost of next-generation sequencing limits widespread implementation. Double-batched sequencing (DoBSeq) is a cost-effective method that uses two-dimensional overlapping pool sequencing to enable individual-level rare variant detection. However, the resulting high-depth, complex data require a specialized workflow for efficient, sensitive, and reproducible analysis. We developed DoBSeqWF (DoBSeq Workflow), a Nextflow-based pipeline that processes pooled sequencing data from alignment through variant calling, filtering, and final variant assignment. Using a childhood cancer cohort of 200 individuals with whole genome sequencing as a reference, we created training and validation datasets, benchmarked multiple variant callers, and implemented machine learning filters to improve rare variant detection while maintaining high sensitivity. DoBSeqWF demonstrates accurate and scalable rare variant detection within the evaluated experimental setting and provides a promising avenue for future cost-effective genetic screening programmes.

## Linked entities

- **Diseases:** childhood cancer (MONDO:0006517)

## Full-text entities

- **Diseases:** genetic diseases (MESH:D030342), cancer (MESH:D009369)

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12907731/full.md

---
Source: https://tomesphere.com/paper/PMC12907731