# parsomics: a data-driven framework for metagenomics data integration powered by a local relational database

**Authors:** Pedro Sader de Azevedo, Meiski Mariá Vedovatto, Pedro Coelho Gimenes de Freitas, Rafaela Beatriz Silva Luz, Rodrigo Silva Araujo Streit, Gabriela Felix Persinoti

PMC · DOI: 10.1093/bioadv/vbag049 · Bioinformatics Advances · 2026-02-15

## TL;DR

parsomics is a new tool for managing and integrating metagenomics data using a local relational database, making it easier to handle complex microbial community datasets.

## Contribution

parsomics introduces a lightweight, extensible framework for local storage and integration of metagenomics data using a relational database.

## Key findings

- parsomics uses PostgreSQL and Python to create a user-configured relational database for metagenomics data.
- The tool supports modular extensions and is designed for prokaryotic metagenomics analysis.
- It is open-source and available for major GNU/Linux environments.

## Abstract

Metagenomics enables the analysis of complex microbial communities directly from environmental samples, resulting in massive datasets that are processed using multiple tools and workflows. Data integration is key for metagenomics research, however, challenges in data organization and management locally remain open in existing workflows.

We present parsomics, a lightweight and extensible data management tool designed for efficient local storage, organization, and integration of metagenomic analysis results. Built upon PostgreSQL and implemented in Python, parsomics leverages a user-defined configuration file to automatically construct a relational database tailored to metagenomics-based data. It is user-friendly, easy to deploy, and implements modular plugin-based extensions to support diverse data types and outputs. parsomics can be installed in every major GNU/Linux environment and currently focuses on prokaryotic metagenomics analysis.

parsomics is an open-source project and its source code is available at https://gitlab.com/parsomics under the GPLv3 license. Comprehensive documentation can be found at https://parsomics.org and https://api.parsomics.org.

## Full-text entities

- **Chemicals:** carbohydrate (MESH:D002241)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13006488/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13006488/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC13006488/full.md

---
Source: https://tomesphere.com/paper/PMC13006488