# Atlantool: a command line tool to retrieve DNA and RNA sequencing reads from BAM files by the read identifier

**Authors:** Emma M Rath, Huy Le, Yukhym Pyshnohraiev, Amitdev Ranjitdev, Robin Stocker, Alex Yakovlev, Sally L Dunwoodie, Sally L Dunwoodie, David S Winlaw, Eleni Giannoulatou, Natasha Nassar, Edwin Kirk, Gavin Chapman, Gillian Blue, Gary Sholler, Samantha Lain, David S Winlaw, Sally L Dunwoodie, Eleni Giannoulatou

PMC · DOI: 10.1093/bioadv/vbaf226 · Bioinformatics Advances · 2025-09-24

## TL;DR

Atlantool is a fast command-line tool that efficiently retrieves DNA or RNA sequencing reads from BAM files using read identifiers.

## Contribution

Atlantool introduces a reliable and efficient method to access sequencing reads by their identifiers, which was previously lacking.

## Key findings

- Atlantool allows instant retrieval of sequencing reads after a one-time index creation.
- The tool supports any read length and operates similarly to SAMtools.
- Atlantool is freely available for multiple platforms and Java environments.

## Abstract

DNA or RNA sequencing produce a large volume of data that is usually stored in a Binary Alignment Map (BAM) file format. Processing and analysis of this large genomic data require specialized software tools. The majority of processing requirements involve accessing DNA or RNA data by chromosomal co-ordinates using SAMtools or similar software. However, challenges arise where accessing the data by the sequencing read identifier is required, and as yet there is no reliable, efficient method or tool to do this. Here we present Atlantool, a fast software that can retrieve sequencing reads from a BAM file by the read identifier. Retrieval of sequencing reads using Atlantool requires a simple command line command similar to SAMtools. After a one-time creation of a read identifier index in the same BGZF format as BAM files, retrieval of data by the read identifier appears to be instantaneous. The sequencing reads can be of any length. Atlantool fills the existing need for a reliable tool to efficiently retrieve specific records from high volume DNA or RNA sequencing data and will enable new genomic analyses to be envisaged and carried out.

Precompiled Atlantool executables are freely available for download from https://github.com/VCCRI/atlantool/releases for Linux, macOS, and Windows platforms, as is a Java JAR file that permits Atlantool to run in a Java environment. The source code and user documentation are available at https://github.com/VCCRI/atlantool/.

## Full-text entities

- **Diseases:** Cardiovascular (MESH:D002318), Congenital Heart Disease (MESH:D006330), BAM (MESH:C535477)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12602190/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12602190/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/PMC12602190/full.md

---
Source: https://tomesphere.com/paper/PMC12602190