# MHASS: Microbiome HiFi Amplicon Sequencing Simulator

**Authors:** Rye Howard-Stone, Ion I Măndoiu

PMC · DOI: 10.1093/bioinformatics/btaf656 · Bioinformatics · 2025-12-06

## TL;DR

MHASS is a tool that generates realistic synthetic sequencing data for microbiome studies to improve analysis workflows.

## Contribution

MHASS introduces a genome-aware method for simulating HiFi amplicon data with realistic barcoding and sequencing characteristics.

## Key findings

- MHASS integrates abundance modeling and pass-number distributions from real sequencing runs.
- The tool supports benchmarking of long-read microbiome analysis workflows like ASV clustering.
- MHASS is freely available with installation instructions and evaluation data.

## Abstract

Microbiome HiFi Amplicon Sequence Simulator (MHASS) creates realistic synthetic PacBio HiFi amplicon sequencing datasets for microbiome studies, by integrating genome-aware abundance modeling, realistic dual-barcoding strategies, and empirically derived pass-number distributions from actual sequencing runs. MHASS generates datasets tailored for rigorous benchmarking and validation of long-read microbiome analysis workflows, including ASV clustering and taxonomic assignment.

Implemented in Python with automated dependency management, the source code for MHASS is freely available at https://github.com/rhowardstone/MHASS along with installation instructions. Our code is also published on Zenodo at https://doi.org/10.5281/zenodo.17486364. The data underlying this article are available on GitHub at https://github.com/rhowardstone/MHASS_evaluation/.

## Full-text entities

- **Diseases:** MHASS (MESH:C565484)
- **Chemicals:** Titan-1 (-)
- **Species:** Escherichia coli (E. coli, species) [taxon 562], Pseudomonas aeruginosa (species) [taxon 287], Salmonella enterica (species) [taxon 28901]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12790812/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12790812/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/PMC12790812/full.md

---
Source: https://tomesphere.com/paper/PMC12790812