# GutMIND: A multi-cohort machine learning framework for integrative characteristics of the microbiota-gut-brain axis in neuropsychiatric disorders

**Authors:** Yanmei Ju, Shutian Lin, Shaohua Hu, Xin Jin, Liang Xiao, Tao Zhang, Yudan Zhang, Liping Zhang, Xiancang Ma, Feng Zhu, Ruijin Guo

PMC · DOI: 10.1080/19490976.2026.2630563 · Gut Microbes · 2026-02-16

## TL;DR

This paper introduces GutMIND, a large database and machine learning tools to study the gut-brain connection in neuropsychiatric disorders using microbiome data from 3,492 individuals across 12 countries.

## Contribution

The novel contribution is the creation of the GutMIND database and the MetaClassifier framework for diagnosing neuropsychiatric disorders using gut microbiome data.

## Key findings

- Microbial community heterogeneity was significantly higher in patients with neuropsychiatric disorders compared to healthy controls.
- MetaClassifier achieved a mean AUROC of 0.69 in diagnosing 8 disorders using gut microbiome data.
- Nine core protective microbiota species were identified, linked to glutamate synthesis and acetate production.

## Abstract

Emerging evidence underscores bidirectional communication along the microbiota-gut-brain axis in neuropsychiatric disorders. However, the field lacks dedicated metagenomic resources with standardized phenotyping for these conditions. Existing single-cohort studies face inherent limitations due to restricted sample sizes, confounding heterogeneity, and methodological fragmentation, compromising reproducibility and mechanistic insights. To overcome these challenges, we constructed the Gut Microbiome in Multinational Integrated Neuropsychiatric Disorders (GutMIND) database, a comprehensive resource integrating shotgun metagenomic data with harmonized metadata. Adhering to a standardized preprocessing protocol and rigorous quality control workflow, this dataset represents the largest gut-brain microbiome repository to date, encompassing 31 studies across 12 countries (n = 3,492) spanning 14 neuropsychiatric conditions. Utilizing this dataset, we characterized microbial community heterogeneity, which was significantly elevated in patients compared to healthy controls. Subsequently, we developed a computational framework, MetaClassifier, enabling the diagnosis of neuropsychiatric disorders and the identification of microbial biomarkers. Employing a comprehensive two-stage validation strategy, we first assessed the model utilizing taxonomic abundance profiles via nested cross-validation in the high-quality discovery cohort (n = 2,734), achieving a mean AUROC of 0.69 (range: 0.55–0.78) across 8 disorders. Its robustness was further confirmed in an independent platform-extended validation cohort (n = 400), yielding a mean AUROC of 0.71 (range: 0.60–0.76). We also developed the Microbial Gut-Brain Axis Health Index (MGBA-HI), which effectively distinguished neuropsychiatric status in both the high-quality cohort and the platform-extended cohort. Furthermore, integrative analysis of health-abundant species, index-derived biomarkers, and ecological prevalence, we identified 9 core neuropsychiatric-protective microbiota. These species predominantly exhibited metabolic capacities linked to glutamate synthesis and acetate production. Building upon this, the GutMIND framework ensures robust cross-cohort comparability while minimizing technical heterogeneity, thereby enhancing inferential rigor in gut microbiome-neuropsychiatry research. Notably, the MetaClassifier, MGBA-HI, and core microbiota hold translational potential for developing microbiome-based prognostic tools and personalized therapeutic strategies in neuropsychiatric disorders. The source code and usage instructions for MetaClassifier are accessible at https://github.com/juyanmei/MetaClassifier.

## Linked entities

- **Chemicals:** glutamate (PubChem CID 611), acetate (PubChem CID 175)

## Full-text entities

- **Genes:** CECR (cat eye syndrome chromosome region) [NCBI Gene 1055] {aka CES}
- **Diseases:** anorexia nervosa (MESH:D000856), BD (MESH:D001714), OCD (MESH:D009771), DD (MESH:D003866), SAD (MESH:D000072861), developmental disorders (MESH:D002658), mental disease (MESH:D008607), gut-brain axis disorders (MESH:D001927), III (MESH:C537189), neuropsychiatric health (OMIM:603663), neuropsychiatric (MESH:C000631768), obesity (MESH:D009765), GutMIND (MESH:D000081042), neuropsychiatric status (MESH:D013226), motor deficits (MESH:D009461), MDD (MESH:D003865), PD (MESH:D010300), ASD (MESH:D000067877), GBM (MESH:D005910), neurodegenerative conditions (MESH:D019636), SCZ (MESH:D012559), behavioral deficits (MESH:D019958), AD (MESH:D000544), Neuropsychiatric Disorders (MESH:D001523)
- **Chemicals:** SCFA (MESH:D005232), adenosylcobalamin (MESH:C000913), acetate (MESH:D000085), KYNA (-), butyrate (MESH:D002087), S-adenosylmethionine (MESH:D012436), glutamate (MESH:D018698), GABA (MESH:D005680)
- **Species:** Bacillota (clostridial firmicutes, phylum) [taxon 1239], Ruthenibacterium lactatiformans (species) [taxon 1550024], gut metagenome (species) [taxon 749906], Faecalibacterium prausnitzii (species) [taxon 853], Homo sapiens (human, species) [taxon 9606], Bacteroides (genus) [taxon 816], Bifidobacterium longum (species) [taxon 216816], Phocaeicola (genus) [taxon 909656]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12915850/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12915850/full.md

## References

65 references — full list in the complete paper: https://tomesphere.com/paper/PMC12915850/full.md

---
Source: https://tomesphere.com/paper/PMC12915850