# ABaCo: addressing heterogeneity challenges in metagenomic data integration with adversarial generative models

**Authors:** Edir Vidal, Angel L Phanthanourak, Atieh Gharib, Henry Webel, Juliana Assis, Sebastián Ayala-Ruano, André F Cunha, Alberto Santos

PMC · DOI: 10.1093/nar/gkag227 · Nucleic Acids Research · 2026-03-17

## TL;DR

ABaCo is a new method that uses generative models to integrate diverse metagenomic datasets, improving accuracy and preserving biological signals.

## Contribution

ABaCo introduces a novel adversarial generative model for metagenomic data integration that outperforms existing methods.

## Key findings

- ABaCo effectively integrates metagenomic data from multiple studies.
- The model corrects technical heterogeneity while preserving taxonomic-level biological signals.
- ABaCo outperforms existing methods in metagenomic data integration.

## Abstract

The rapid advancement of high-throughput metagenomics has produced extensive and heterogeneous datasets with significant implications for environmental and human health. Integrating these datasets is crucial for understanding the functional roles of microbiomes and the interactions within microbial communities. However, this integration remains challenging due to technical heterogeneity and the inherent complexity of these biological systems. To address these challenges, we introduce ABaCo, a generative model that combines a variational autoencoder with an adversarial discriminator specifically designed to handle the unique characteristics of metagenomic data. Our results demonstrate that ABaCo effectively integrates metagenomic data from multiple studies, corrects technical heterogeneity, outperforms existing methods, and preserves taxonomic-level biological signals. We have developed ABaCo as an open-source, fully documented Python library to facilitate, support and enhance metagenomics research in the scientific community.

Graphical Abstract

## Full-text entities

- **Diseases:** IBD (MESH:D015212), Ulcerative Colitis (MESH:D003093), Crohn Disease (MESH:D003424), ARI (MESH:D000275)
- **Chemicals:** ABaCo (-), phenol (MESH:D019800)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** V100S

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12993454/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12993454/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12993454/full.md

---
Source: https://tomesphere.com/paper/PMC12993454