# Doblin: inferring dominant clonal lineages from high-resolution DNA barcoding time series

**Authors:** Melis Gencel, David Gagné-Leroux, Adrian W R Serohijos

PMC · DOI: 10.1093/bioinformatics/btaf555 · 2025-10-06

## TL;DR

Doblin is a tool that identifies dominant cell lineages in populations using DNA barcoding data, helping track evolutionary dynamics and competition.

## Contribution

Introduces Doblin, a new R package for analyzing clonal lineage dynamics from high-resolution DNA barcoding time series.

## Key findings

- Doblin accurately recovers clone identities and relative fitness from simulated lineage data.
- Doblin successfully analyzed clonal dynamics in E. coli under antibiotic treatment and gut microbial colonization.
- The tool is versatile and applicable across various experimental setups involving lineage tracking.

## Abstract

The lineage dynamics and history of cells in a population reflect the interplay of evolutionary forces they experience, including mutation, drift, and selection. When the population is polyclonal, lineage dynamics also manifest the extent of clonal competition among co-existing mutational variants. If the population exists in a community of other species, the lineage dynamics could also reflect the population’s ecological interaction with the rest of the community. Recent advances in high-resolution lineage tracking via DNA barcoding, coupled with next-generation sequencing of bacteria, yeast, and mammalian cells, allow for precise quantification of clonal dynamics in these organisms.

In this work, we introduce Doblin, an R suite for identifying dominant barcode lineages based on high-resolution lineage tracking data. We first benchmarked Doblin’s accuracy using lineage data from evolutionary simulations, showing that it recovers the clones’ identity and relative fitness in the simulation. Next, we applied Doblin to analyze clonal dynamics in laboratory evolutions of Escherichia coli populations undergoing antibiotic treatment and in colonization experiments of the gut microbial community. Doblin’s versatility allows it to be applied to lineage time-series data across different experimental setups.

Doblin is available on CRAN (https://CRAN.R-project.org/package=doblin) and Github (https://github.com/dagagf/doblin).

## Linked entities

- **Species:** Escherichia coli (taxon 562)

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Escherichia coli (E. coli, species) [taxon 562], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395]

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12553330/full.md

---
Source: https://tomesphere.com/paper/PMC12553330