# MoGAAAP: a modular Snakemake workflow for automated genome assembly and annotation with quality assessment

**Authors:** Dirk-Jan M van Workum, Kuntal K Dey, Alexander Kozik, Dean O Lavelle, Dick de Ridder, M Eric Schranz, Richard W Michelmore, Sandra Smit

PMC · DOI: 10.1093/nargab/lqag008 · 2026-01-22

## TL;DR

MoGAAAP is a pipeline that automates genome assembly and annotation using various sequencing data, providing quality assessments for comparative genomics.

## Contribution

The novelty lies in a modular, automated pipeline for genome assembly and annotation with comprehensive quality assessment.

## Key findings

- The pipeline is species-agnostic and supports HiFi, ONT, and Hi-C reads.
- It generates detailed quality reports for assembly and annotation.
- The pipeline is implemented in Snakemake and is publicly available for use.

## Abstract

With the current speed of sequencing, there is a desire for standardized and automated genome assembly and annotation to produce high-quality genomes as input for comparative (pan)genomics. Therefore, we created a convenience pipeline using existing tools that creates annotated genome assemblies from HiFi (and optionally ultra-long ONT and/or Hi-C) reads for a set of related individuals as well as a related reference genome. Our pipeline is species-agnostic and generates an extensive quality assessment report that can be used for manual filtering and refinement of the assembly and annotation. It includes statistics for individual completeness and contamination assessments as well as a concise pangenome view. The pipeline is implemented in Snakemake and available with a GPLv3 licence at GitHub under github.com/dirkjanvw/MoGAAAP, at Zenodo under doi.org/10.5281/zenodo.14833021, and can be installed through Bioconda.

## Full-text entities

- **Chemicals:** 4TU (-)
- **Species:** Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Lactuca serriola (compass-plant, species) [taxon 75943], Homo sapiens (human, species) [taxon 9606], Lactuca sativa (cultivated lettuce, species) [taxon 4236]

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12824462/full.md

---
Source: https://tomesphere.com/paper/PMC12824462