# PRIME: a database for 16S rRNA microbiome data with phenotypic reference and comprehensive metadata

**Authors:** Zhizhuo Zhang, Hongyu Zhao, Tao Wang

PMC · DOI: 10.1093/nar/gkaf1057 · Nucleic Acids Research · 2025-10-31

## TL;DR

PRIME is a comprehensive database of human microbiome data with standardized metadata and taxonomic profiles, enabling cross-study analysis and phenotype-based research.

## Contribution

PRIME introduces a curated, harmonized database of 16S rRNA microbiome data with extensive metadata and standardized taxonomic profiles for reproducible and integrative analysis.

## Key findings

- PRIME aggregates 53,449 samples from 111 studies with harmonized metadata across 93 body sites and 101 phenotypic categories.
- The database provides taxonomic abundance profiles using SILVA and Greengenes2 reference databases at multiple taxonomic levels.
- PRIME supports interactive exploration and programmatic access via a web interface, APIs, and an R package.

## Abstract

PRIME (Phenotypic Reference for Integrated Microbiome Enrichment) is a curated and standardized database of human microbiome 16S rRNA amplicon sequencing data, designed to facilitate cross-study analysis, reproducibility, and phenotype-driven discovery. PRIME aggregates 53 449 samples from 111 public studies, covering 93 body sites and 101 phenotypic categories, with detailed harmonization of sample-level metadata such as disease status, demographics, body sites, sequencing protocols, and experimental design. Each sample includes taxonomic abundance profiles generated via a consistent pipeline using both SILVA (138.2) and Greengenes2 (2024.09) reference databases, with results reported at multiple taxonomic levels as observed abundances (read counts) and relative abundances (proportions). A major strength of PRIME is its extensive manual curation, which standardizes phenotypic and contextual metadata across studies, enabling precise querying and robust phenotype-based comparisons. Users can interactively explore the database through a modern web interface, filter and visualize data by metadata fields, and download customized subsets. Programmatic access is supported via RESTful APIs and R package. PRIME aims to advance microbiome data integration and is continuously updated to incorporate new studies and features. The database is freely available at https://primedb.sjtu.edu.cn.

Graphical Abstract

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12807763/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12807763/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12807763/full.md

---
Source: https://tomesphere.com/paper/PMC12807763