# Multifaceted human gut microbiome data associated with health and nutrition

**Authors:** Lucia Maisto, Claudia Telegrafo, Francesco Rubino, Monica Santamaria, Maria H. Traka, Apollonia Tullo, Jildau Bouwman, Elisabetta Sbisà, Bachir Balech

PMC · DOI: 10.3389/fmicb.2026.1722500 · Frontiers in Microbiology · 2026-02-16

## TL;DR

This review discusses how gut microbiome data related to health and nutrition are scattered and need better standardization for improved research and therapies.

## Contribution

The paper proposes using FAIR data principles and semantic technologies to enhance microbiome data integration and reuse.

## Key findings

- Microbiome data are scattered across databases with varying curation and standardization.
- FAIR principles can improve data discovery and support new scientific hypotheses.
- Semantic classification and ontologies can enhance metadata enrichment and alignment.

## Abstract

The microbiome, also considered the hidden organ, is a fundamental ecosystem directly associated with the disease and health status of the human body. With the availability of high-throughput DNA sequencing technologies, a growing number of studies from clinical and experimental (observation and intervention) samples are constantly revealing new findings on the relationship between human organs and their microbiomes. In such a context, diet and nutrition are among the key factors influencing microbiome composition, richness, and functional behavior. In this review, we illustrate how microbiome-related data and associated metadata are in recent times scattered across primary and specialized databases with different levels of curation, annotation, and standardization, limiting, to some extent, the possibility of deep data discovery, reuse, alignment, and harmonization. Therefore, we describe the way Findable, Accessible, Interoperable, and Reusable (FAIR) data principles would enhance the onset of novel scientific hypotheses and potential microbiome-targeted therapies by improving the standardization policies in data sources. Accordingly, using advanced semantic classification and data mining technologies based on suitable and comprehensive ontologies, annotations of studies present in source databases or in scientific literature would further improve the data and metadata enrichment, integration and alignment relevant to microbiome data associated with health, disease and nutrition.

## Full-text entities

- **Diseases:** neurological development disorders (MESH:D002658), Chronic intestinal disorders (MESH:D007410), glucose intolerance (MESH:D018149), type 2 diabetes (MESH:D003924), systemic (MESH:D015619), immune dysfunctions (MESH:D007154), cardiovascular diseases (MESH:D002318), insulin resistance (MESH:D007333), colon and colorectal cancer (MESH:D015179), viral infections (MESH:D014777), autoimmune diseases (MESH:D001327), obese (MESH:D009765), tumorigenesis (MESH:D063646), diabetic (MESH:D003920), dysbiosis (MESH:D064806), inflammation (MESH:D007249)
- **Chemicals:** SCFA (MESH:D005232), glucose (MESH:D005947), lipid (MESH:D008055), butyrate (MESH:D002087), carbohydrates (MESH:D002241), reuterin (MESH:C047158)
- **Species:** gut metagenome (species) [taxon 749906], Limosilactobacillus reuteri (species) [taxon 1598], Enterobacteriaceae (enterobacteria, family) [taxon 543], Bacillota (clostridial firmicutes, phylum) [taxon 1239], Bacteroides fragilis (species) [taxon 817], Homo sapiens (human, species) [taxon 9606], human metagenome (species) [taxon 646099], Escherichia coli (E. coli, species) [taxon 562]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12950768/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12950768/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC12950768/full.md

---
Source: https://tomesphere.com/paper/PMC12950768