# esloco: simulation-based estimation of local coverage in long-read DNA sequencing

**Authors:** Adrian Weich, Christopher Lischer, Julio Vera

PMC · DOI: 10.1093/bioinformatics/btag009 · Bioinformatics · 2026-01-09

## TL;DR

esloco is a simulation tool that helps predict DNA sequencing coverage in long-read experiments, reducing the need for costly pilot studies.

## Contribution

esloco introduces a Monte Carlo-based framework for estimating local coverage in long-read sequencing, even for unknown target regions.

## Key findings

- esloco accurately predicts local coverage based on sequencing depth and read length distribution.
- Benchmarking on a 45-gene panel showed strong agreement with real sequencing data.
- The tool supports PCR-free and unknown target region scenarios like viral integration and CRISPR-Cas9.

## Abstract

Long-read DNA sequencing is increasingly applied for whole-genome studies, yet experimental planning often lacks reliable estimates of target region coverage, leading to costly and time-consuming pilot studies and replicates. We present esloco, a Monte Carlo-based simulation framework for estimating local coverage in long-read sequencing experiments, including scenarios with unknown target regions (e.g. viral integration, CRISPR-Cas9) or PCR-free designs (e.g. base modifications). By modeling coverage as a function of sequencing depth and read length distribution, esloco enables informed predictions of local sequencing outcomes. Benchmarking across a 45-gene panel demonstrated close agreement with empirical data, underscoring the framework’s reliability.

esloco is a Python package available on PyPI (https://pypi.org/project/esloco/), GitHub (https://github.com/aweich/esloco), and Zenodo (https://doi.org/10.5281/zenodo.17776161).

## Full-text entities

- **Genes:** GBA1 (glucosylceramidase beta 1) [NCBI Gene 2629] {aka GBA, GCB, GLUC}, ATP1A3 (ATPase Na+/K+ transporting subunit alpha 3) [NCBI Gene 478] {aka AHC2, CAPOS, DEE99, DYT12, RDP}, LRRK2 (leucine rich repeat kinase 2) [NCBI Gene 120892] {aka AURA17, DARDARIN, PARK8, RIPK7, ROCO2}, SLC6A3 (solute carrier family 6 member 3) [NCBI Gene 6531] {aka DAT, DAT1, PKDYS, PKDYS1}, PLA2G6 (phospholipase A2 group VI) [NCBI Gene 8398] {aka CaI-PLA2, GVI, INAD1, IPLA2-VIA, NBIA2, NBIA2A}, WDR45 (WD repeat domain 45) [NCBI Gene 11152] {aka JM5, NBIA4, NBIA5, WDRX1, WIPI-4, WIPI4}, RAB39B (RAB39B, member RAS oncogene family) [NCBI Gene 116442] {aka BGMR, MRX72, WSMN, WSN, XLID72}, ATXN1 (ataxin 1) [NCBI Gene 6310] {aka ATX1, D6S504E, SCA1}, MAPT (microtubule associated protein tau) [NCBI Gene 4137] {aka DDPAC, FTD1, FTDP-17, MAPTL, MSTD, MTBT1}
- **Diseases:** Parkinson's disease (MESH:D010300)
- **Chemicals:** ONT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12866673/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12866673/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/PMC12866673/full.md

---
Source: https://tomesphere.com/paper/PMC12866673