# Pipeasm: a tool for automated large chromosome-scale genome assembly and evaluation

**Authors:** Bruno Marques Silva, Fernanda de Jesus Trindade, Lucas Eduardo Costa Canesin, Giordano Souza, Alexandre Aleixo, Gisele Nunes, Renato Renison Moreira-Oliveira

PMC · DOI: 10.1093/bioadv/vbaf326 · Bioinformatics Advances · 2026-01-02

## TL;DR

Pipeasm is an automated pipeline for assembling and evaluating large genomes using multiple data types, producing high-quality chromosome-scale assemblies efficiently.

## Contribution

Pipeasm introduces a reproducible and automated genome assembly workflow that integrates multiple sequencing technologies and evaluation steps.

## Key findings

- Pipeasm produced assemblies with scaffold L50 matching expected chromosome lengths and up to 99.6% BUSCO completeness.
- The pipeline successfully handled four vertebrate species with distinct genomic characteristics.
- Detailed reports and Hi-C maps are generated automatically for downstream analysis and curation.

## Abstract

Although high-quality chromosome-scale genome assemblies are feasible, assembling large ones remains complex and resource-intensive. This demands reproducible and automated workflows that not only implement current best practices efficiently but also allow for improvement alongside future updates to those standards.

We present Pipeasm, a Snakemake-based genome assembly pipeline containerized with Singularity. Pipeasm can use HiFi, ONT, and Hi-C data, automating read trimming, nuclear and mitogenome assembly, scaffolding, decontamination, and quality evaluation. Applied to four vertebrate species with distinct genomic characteristics, starting from a single command line and configuration file, it produced assemblies with scaffold L50 proportional to the expected chromosome and genome length, and up to 99.6% BUSCO completeness. Its output also includes detailed reports for each step, genome statistics, Hi-C maps, and files ready for curation.

Pipeasm is available at https://github.com/itvgenomics/pipeasm, implemented in Python/Snakemake with Singularity, and runs on Unix-based systems.

## Full-text entities

- **Chemicals:** RAM (MESH:C071315)
- **Species:** Gallus gallus (bantam, species) [taxon 9031], Pseudophryne corroboree (coroboree toadlet, species) [taxon 495146], Taeniopygia guttata (zebra finch, species) [taxon 59729], Elephas maximus (Asian elephant, species) [taxon 9783], Homo sapiens (human, species) [taxon 9606], Gallus (genus) [taxon 9030]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12800776/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12800776/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/PMC12800776/full.md

---
Source: https://tomesphere.com/paper/PMC12800776