Study on Association Between Gut Microbiota, Serum Metabolism and Gestational Diabetes Mellitus Based on Metagenomic and Metabolomics Analysis
Wenduo Yu, Kun Tang, Rongjing An, Sujuan Ma, Hongzhuan Tan, Mengshi Chen

TL;DR
This study found that women who later develop gestational diabetes have different gut bacteria and blood metabolites early in pregnancy.
Contribution
The study identifies specific gut microbes and metabolic signatures in early pregnancy linked to later gestational diabetes.
Findings
Women who developed GDM had lower gut microbial diversity and higher beta diversity in early pregnancy.
Eleven microbial species and 64 plasma metabolites were found to differ between GDM and non-GDM groups.
Certain microbes correlated with metabolic indicators like triglycerides and cholesterol.
Abstract
Background/Objectives: This study aimed to explore the association between maternal gut microbiota and metabolic profiles in the first trimester and the subsequent risk of gestational diabetes mellitus (GDM), as well as to characterize association patterns linking gut microbiota, serum metabolites, and metabolic traits. Methods: A nested case–control study was conducted among women with GDM (n = 47) and those without GDM (n = 94). Metagenomic sequencing was applied to analyze fecal microbiota, and liquid chromatography–mass spectrometry (LC–MS) was used for non-targeted plasma metabolomics. Differential microbiota and metabolites between groups were identified, and correlation analyses were conducted to assess their associations with clinical indicators. Results: Women who later developed GDM showed lower alpha diversity and higher beta diversity. Eleven differential species were…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —Natural Science Foundation of Hunan Province, China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGut microbiota and health · Gestational Diabetes Research and Management · Reproductive tract infections research
1. Introduction
In recent decades, the incidence of gestational diabetes mellitus (GDM) has continuously increased [1], seriously affecting the health of both mothers and infants [2,3,4].
It has been well confirmed that the gut microbiota can produce a variety of compounds that modulate the activity of the distal organs and also play a role in the development of GDM [5,6]. Alteration in the diversity of bacterial species and the abundance of specific bacterial taxa have been observed in GDM women [7,8,9,10,11,12,13,14,15,16]. The main dysbiosis of gut microbiota in pregnant women with GDM is the abundance of Firmicutes (or Bacillota) and Bacteroidetes (or Bacteroidota) or changes in F/B ratio [8,9,10]; an increase in phyla Firmicutes [11,12,13,14] or F/B ratio [14,15,16,17] was observed in women with GDM. However, a decrease in phyla Firmicutes, or reduced F/B ratio (mainly reduced class Clostridia), in the T2D patients was also observed [14,15,16,17,18,19].
Alternated composition of other phyla, such as Proteobacteria [17], Pseudomonadota [20], Actinobacteria [14,17], Proteobacteria [17], Verrucomicrobia [16,17], and Fusobacteria [17], have been observed to be associated with GDM. Genera such as Actinomyces [16,21], Bacteroides [14,16,22,23,24,25], Blautia [9,12,26,27], Streptococcus [7], and Prevotella [11,16] were predominantly enriched in the gut microbiome of women with GDM, while genera like Bifidobacterium [24,26,28] were relatively depleted. Numerous studies have identified significant differences in the abundance of many species. Among them, Bacteroides massiliensis [29] and Parabacteroides distasonis [6] were enriched in patients with GDM. Additionally, Clostridium_spiroforme, Eubacterium_dolichum, and Ruminococcus_gnavus were positively correlated with fasting blood glucose (FBG) levels [10].
Anaerostipes hadrus were associated with an impaired glucose tolerance [22]. Several species depleted in patients with GDM were observed, including Ruminococcus and Alistipes, putredinis, Bifidobacterium dentium [7,13,16,29], and Pyramidobacter piscolens [10], compared with normal pregnant women.
During pregnancy, particularly in the mid- and late-term, women experience a series of metabolic changes, such as increased adiposity and decrease in insulin sensitivity, which leads to an increased risk of diabetes in women in the third trimester. Results of a recent meta-analysis revealed that 67 metabolites differed between patients with GDM and controls. Among them, 25 metabolites were significantly associated with GDM risk, reflecting disruptions in multiple metabolic pathways, including amino acid, lipid, carbohydrate, and energy metabolism. Specifically, elevated levels of isoleucine, leucine, C16:0, glucose, pyruvate, lactate and other metabolites were linked to an increased risk of GDM, whereas higher concentrations of glutamine, histidine, and related metabolites were associated with a reduced risk [30]. Ye et al. [31] reported that, compared with the control group, patients with GDM had an increased abundance of 2-hydroxybutyric acid and L-alpha-aminobutyric acid, but a decrease in methionine sulfoxide, allantoin, and dopamine and dopaminergic synapse. Previous studies found that disturbances in phenylalanine and tyrosine metabolism may serve as potential biomarkers for GDM [29,30,32,33]. Sun et al. [29] further demonstrated that alterations in gut microbiome composition and microbial metabolic pathways related to carbohydrate and lipid metabolism were associated with GDM during pregnancy.
Research has found that women with GDM have a higher abundance of carbohydrate metabolism-related gut microbiota compared with control women [24], such as Ruminococcaceae, Parabacteroides distasonis, and Prevotella, which are believed to be associated with metabolic pathways for carbohydrate metabolism and insulin signaling [21]. In women with GDM combined with hyperlipidemia, the relative abundances of Streptococcus, Faecalibacterium, Prevotella, Haemophilus, and Actinomyces were significantly increased, and these bacterial abundances were positively correlated with elevated total cholesterol levels [16]. Additionally, patients with GDM with obesity exhibited pronounced intestinal microbiota dysbiosis [34]. These findings suggest that gut microbiota dysbiosis may be associated with abnormal carbohydrate and lipid metabolism, and could play a role in the pathogenesis of GDM.
More and more observational studies have reported that the characteristics of gut microbiome and metabolites in patients with GDM are significantly altered; however, the specific microbial taxa and metabolic features identified vary across studies and, in some cases, show opposing trends. Increasing evidence suggests that variations in gut microbiota and metabolic profiles can emerge early in pregnancy due to intrinsic host metabolic and inflammatory states rather than dietary differences alone. Such early biological heterogeneity may predispose women to altered glucose regulation, providing a plausible explanation for why first-trimester microbiota–metabolite signatures are detectable before the clinical onset of GDM. However, most existing evidence is derived from observational studies, and the biological mechanisms underlying the association between gut microbiome alterations and the development of GDM remain incompletely understood. The inconsistencies reported across studies are likely attributable to methodological heterogeneity, including differences in study populations, sample sizes, and study designs [35,36]. Moreover, the analysis of gut microbiota or metabolites alone cannot determine which gut microbiota play a key role in the occurrence of GDM, because the gut microbiota and the host are involved in co-metabolic interactions across multiple metabolic pathways. Therefore, conducting multi-omics studies on microbial metabolites, gut microbiota, and host clinical characteristics simultaneously may provide insights into the mechanisms of disease development. However, multi-omics studies that integrate metagenomics and metabolomics in GDM remain limited and are based on cross-sectional designs [31].
Previous studies have analyzed the differences in gut microbiota composition between GDM cases and controls [37,38]. However, these analyses were primarily based on 16S rRNA sequencing, which limited in-depth exploration at the species level and associations with metabolic profiles. Using metagenomics and untargeted Ultra-high Performance LC-MS analysis, we conducted a nested case–control study to examine associations between gut microbiota, serum metabolites in early pregnancy, and subsequent GDM. By jointly characterizing microbial taxa, circulating metabolites, and clinical indicators, this study aims to provide a more comprehensive descriptive view of biological patterns associated with GDM susceptibility.
2. Materials and Methods
2.1. Study Design and Participants
A nested case–control study was conducted within a cohort established at the Maternal and Child Health Hospital of Hunan Province, China (EC201624). The study was registered at the Chinese Clinical Trial Registry (ChiCTR; registration number ChiCTR1900020652). All participants provided written informed consent before completing an enrollment questionnaire as well as providing biological samples. The detailed methodology of this cohort has been comprehensively described in a previous publication [37]. All pregnant women underwent an OGTT between 24 and 28 weeks of gestation; GDM cases were defined as women whose blood glucose levels at fasting, 1 h, and 2 h after glucose ingestion were ≥5.1 mmol/L, ≥10.0 mmol/L, and ≥8.5 mmol/L, respectively.
Cases were selected according to the following criteria:
- (1)GDM was diagnosed at 24–28 weeks of gestation;
- (2)Completed questionnaire data and available blood and fecal samples;
- (3)No probiotics and antibiotics during pregnancy; and
- (4)No history of hypertension, thyroid disease, cardiovascular and cerebrovascular disorders during pregnancy.
Based on age (± 3 years) and gestational age (±1 week) at data collection and sample collection, controls with normal blood glucose throughout pregnancy were randomly selected for each case in a 1:2 ratio.
Participants were excluded if they met any of the following conditions:
- (1)Multiple pregnancies;
- (2)Inability to cooperate with the investigation due to mental illness or extreme emotional instability;
- (3)A child with any congenital disease.
2.2. Sample Size Calculation
This study involved 47 case and 94 controls, fulfilling the research requirements. Based on our previous results [37], the gut microbial α-diversity (Simpson index) in women with GDM was 0.945, whereas in women without GDM it was 0.965, with a standard deviation of approximately 0.03. Sample size calculations were performed using PASS (Power Analysis and Sample Size) software (version 25.0.3; NCSS, LLC, Kaysville, UT, USA). With two groups of 47 participants each, a two-sided equal-variance t-test at a significance level of α = 0.05 provides a statistical power of 90.4% to detect a mean difference of 0.02, assuming a common standard deviation of 0.03.
2.3. Data Collection
Basic information, including socio-demographic characteristics (age, ethnicity, educational level, employment, number of pregnancies, monthly household income, and health insurance status), lifestyles, disease history and health conditions, medication use, dietary intake, and supplement use were collected through a self-administered questionnaire completed by each participant at enrollment.
3–5 mL of EDTA-anticoagulated peripheral blood samples were collected from all study subjects by a nurse at enrollment. The blood samples were centrifuged at 4 °C for 15 min (4 °C, 3500× g rpm) within 12 h; the serum and blood cells were stored at −80 °C until processing. All pregnant women underwent an OGTT between 24 and 28 weeks of gestation. The laboratory test results and uncertain medication information were further confirmed through their Maternal and Child Health Manual and medical records.
2.4. Stool Sample Collection and Metagenomic Sequencing
Each woman was given a fecal sampler and provided detailed instructions for sample collection at enrollment. About 10 g fresh fecal samples were collected from each woman and placed immediately at −20 °C. All samples were transported to the laboratory within 12 h.
Fecal bacterial DNA was extracted using QIAamp Fast DNA Mini Kit (QIAGEN, Hilden, Germany) and stored at −80 °C. PCR amplification and sequencing were conducted using the Illumina platform. The amplified PCR products were purified. The purity of DNA from fecal samples was detected using an ultra-micro UV spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA), adhering to the principle that the DNA detection concentration should be higher than 10 μg/μL, with an optimal optical density (OD) value ranging between 1.8 and 2.0.
1 µg of high-quality DNA was processed to construct sequencing libraries, using the NEBNext^®^ Ultra™ DNA Library Prep Kit for Illumina (New England Biolabs, Ipswich, MA, USA) following the manufacturer’s instructions. Library quality and quantity were assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) with a high-sensitivity and wide dynamic range (0.1–50 ng/µL) to ensure that the libraries met Illumina platform quality standards. Absolute quantification of the library was performed using real-time quantitative PCR (qPCR, TaqMan probe method). Cluster generation on the flow cell was conducted using the Illumina cBot system (bridge PCR amplification), with paired-end sequencing libraries constructed using the TruSeq Nano DNA Library Prep Kit (Illumina, San Diego, CA, USA). High-throughput sequencing was ultimately performed on the Illumina HiSeq X Ten platform (2 × 150 bp read length mode), which achieves a single-run throughput of 1.6–1.8 Tb with ≥75% Q30 bases. The sequencing service was completed by Novogene Co., Ltd. (Beijing Novogene Bioinformatics Technology Co., Ltd., Beijing, China).
Raw sequencing reads underwent quality control using KneadData, including adapter removal and removal of host DNA, to generate high-quality clean reads. Taxonomic profiling was performed using MetaPhlAn2 through alignment to microbial reference genomes—including bacteria, archaea, eukaryotes, and viruses—for taxonomic composition and relative abundance annotation. Functional profiling was conducted with HUMAnN3 (v3.0), which mapped reads to pan-genomes and performed translated searches for unclassified reads, ultimately quantifying gene families and metabolic pathways.
2.5. Untargeted Metabolomic Assays
After thawing at −80 °C, serum samples underwent the following pretreatment procedures:
- (1)Addition of L-2-chlorophenylalanine internal standard solution (prepared in methanol) for quantification;
- (2)Protein precipitation using a methanol–acetonitrile mixture (2:1, v/v);
- (3)Acquisition of supernatant after ultrasonication, low-temperature incubation, and subsequent centrifugation;
- (4)Preparation of quality control (QC) samples in parallel to assess system stability.
For the liquid chromatography–mass spectrometry (LC–MS) analysis, a UPLC–QTOF–MS/MS system equipped with an Acquity RSL C120 C18 column (1.8 µm, 2.1 × 100 mm) was used. Data acquisition was performed in both ESI+ and ESI− modes. QC samples were used to monitor system stability, equilibrate the LC–MS system, and evaluate data quality for metabolite identification. The UPLC conditions included a C120 C18 column (100 × 2.1 mm), a column temperature maintained at 30 °C, and an injection volume of 5 µL.
The LC parameters for the positive ion mode were as follows: mobile phase A was 0.1% formic acid in water; mobile phase B was 0.1% formic acid in acetonitrile. The LC parameters for the negative ion mode were as follows: mobile phase A was 0.05% ammonia solution; mobile phase B was 0.05% formic acid in acetonitrile. Sample injection volume was 5 µL; elution gradient: 2% B for 0.0–2.0 min, increased to 90% B for 2.0–17.5 min, and returned to 2% B for 17.5–20.0 min, with a flow rate of 0.2 mL/min.
The MS analysis was performed under the following conditions: electrospray ionization (ESI) was used as the ion source, operating in dual ESI+ and ESI− detection modes. High-purity nitrogen served as the auxiliary gas for both spray ionization and desolvation processes. The instrument was set to full-scan mode covering a mass range of 20–1000 m/z, with the drying gas maintained at 200 °C and flowing at 0.2 mL/min.
2.6. Statistical Analysis
2.6.1. Metagenomic Sequencing Analysis
Alpha diversity was assessed using the Simpson and Shannon indices, with between-group comparisons conducted through Wilcoxon rank-sum tests. Beta diversity was quantified using Bray–Curtis dissimilarity (abundance-based) and Jaccard distance (presence/absence-based) metrics. Principal coordinate analysis (PCoA) was performed using the ade4 package (v1.7-22) in R (v4.4.1) to visualize sample distribution patterns, while hierarchical clustering via the unweighted pair group method with arithmetic mean (UPGMA) was implemented with the phangorn package (v2.11.1). All visualizations were generated using ggplot2 (v3.5.2) with consistent aesthetic parameters.
LEfSe analysis (LDA score > 4) was performed on the Galaxy platform to identify differential species between groups. The results were presented as an LEfSe (v1.0) clustering tree, LDA score plot, and feature table. Spearman correlation analysis was further conducted to evaluate associations among the identified differential species.
Functional annotation was carried out using the HUMAnN2 (v2.8.1) pipeline. Gene families were first annotated based on the UniRef90 database and then mapped to metabolic pathways via the MetaCyc(v27.0) database. The MinPath (v1.4) algorithm was subsequently applied to identify minimal pathway sets, generating functional pathway abundance profiles that comprehensively reflect the metabolic characteristics of microbial communities.
Functional distribution patterns across different groups were visualized using heatmaps generated with the pheatmap package (v1.0.12) in R (v4.4.1). LEfSe (v1.0) analysis (maintaining a consistent LDA score > 4) was implemented on functional gene data to identify group-specific differential metabolic pathways. The Benjamini–Hochberg false discovery rate (FDR) method was employed in differential abundance analysis of species and genera, functional gene and pathway analyses, and intergroup comparisons of KEGG and pathway abundances.
2.6.2. Non-Targeted Metabolomics Data Analysis
Raw data were imported into MetaboScape (version 3.0) for peak picking, noise filtering, and normalization procedures. Subsequent metabolite identification, including metabolite names, molecular formulas, retention times (RT), mass-to-charge ratios (m/z), and peak area values, was reported.
A supervised Orthogonal Partial Least Squares-Discriminant Analysis (OPLS-DA) model was applied as an exploratory multivariate approach to examine the overall metabolic patterns between groups using the ropls package (v1.30.0) in R (v4.4.1), and model robustness was assessed through 200 permutation tests to evaluate potential overfitting, with variable importance in projection (VIP) scores calculated as a supportive measure to rank metabolites contributing to group discrimination. Welch’s t-tests were conducted for between-group metabolite comparisons, followed by FDR adjustments. Differentially expressed metabolites were identified based on three criteria: |log_2_(fold change)| > 0.415 (equivalent to >1.33-fold change), BH-adjusted p-value < 0.05, and VIP score ≥ 1.
The identified differentially expressed metabolites were subjected to metabolic pathway enrichment analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (http://www.genome.jp/kegg/; accessed on 25 June 2024). Pathway significance was assessed via Fisher’s exact test (BH-adjusted p < 0.05). All statistical results were visualized using the R (version 4.4.1) programming environment, including volcano plots displaying metabolite change magnitude (fold change), statistical significance (–log_10_-transformed p-values), and VIP scores of significant metabolites.
To identify more potential correlation, a power-oriented statistical strategy was adopted, which means that spearman rank correlation analysis was conducted to examine the associations among differentially abundant microbiota, differentially expressed metabolites, and clinical indicators. An FDR-adjusted p-value < 0.05 was considered statistically significant.
3. Results
3.1. Characteristics of the Study Population
From 2018 onward, we enrolled 636 pregnant women from Hunan Maternal and Child Health Care Hospital who had provided demographic and socio-economic data, serum and fecal samples in the first trimester, and a completed oral glucose tolerance test (OGTT) during the second trimester. Based on the 2011 International Association of Diabetes and Pregnancy Study Groups (IADPSG) criteria, 94 women (15.41%) were diagnosed with GDM. 47 patients with GDM and 94 matched controls were randomly selected. The main characteristics in the first trimester, including lifestyle factors and clinical indicators of the 141 subjects, are summarized in Table 1 (the corresponding information for the 636 women is summarized in Supplementary Table S1). Age, height, body mass index (BMI), fasting glucose, and triglycerides were significantly different between GDM pregnant women and women without GDM, whereas characteristics such as ethnicity, education level, blood pressure, and total cholesterol were comparable between the two groups (p > 0.05). No participants reported smoking, but 21.4–23.6% of the women reported passive smoking. Only a small percentage of participants reported drinking, and the proportions of passive smoking and drinking did not differ between GDM pregnant women and women without GDM (p > 0.05). In particular, total energy intake and most food-group intakes did not differ significantly between the two groups.
3.2. Metagenomic Analysis of Gut Microbes
A total of 328 GB of raw sequencing data was generated, with an average sequencing depth of 7327.21 Mb per sample. The average data validity rate reached 99.73%. Through metagenomic species annotation, 424 gut microbiota species, classified into 146 genera, 60 families, 29 orders, 18 classes, and 10 phyla, were identified.
At the phylum level, the relative abundance of the top four dominant taxa in descending order was as follows: Firmicutes (68.96%), Proteobacteria (19.35%), Bacteroidetes (10.39%), and Actinobacteria (1.29%). The top four most abundant genera were as follows: Faecalibacterium (26.31%), Bacteroides (13.67%), Eubacterium (2.14%), and Roseburia (1.94%). At the species level, the dominant bacterial species were Ruminococcus bromii (65.78%) and Faecalibacterium prausnitzii (31.88%), while the relative abundance of remaining species each constituted less than 1%.
Compared to non-GDM controls, women with GDM showed significantly reduced gut microbial α-diversity, as measured by both the Simpson index (p < 0.05) and Shannon index (p < 0.05) (Figure 1a). PCoA based on Bray–Curtis and Jaccard distance metrics showed distinct clustering between the GDM and control groups, suggesting differences in overall community composition (Figure 1b).
Differential analysis of microbial community abundance at both genus and species levels was performed using LEfSe. At the genus level, four taxa showed significant differences between the case and control groups. Specifically, the genus Collinsella was enriched in the case group, while Peptostreptococcaceae, Oscillibacter, and Bacteroides were significantly more abundant in the control group (as shown in Figure 1c). At the species level, a total of 11 differentially abundant taxa were identified between the two groups. Collinsella aerofaciens and Clostridium bartlettii were enriched in the case group. In contrast, Anaerotruncus colihominis, Catenibacterium mitsuokai, Alistipes finegoldii, Bacteroidales bacterium ph8, Bacteroides xylanisolvens, Streptococcus salivarius, Lactobacillus casei paracasei, Alistipes putredinis, and Eubacterium eligens showed higher abundance in the control group (as shown in Figure 1d). The corresponding characteristic values and statistical test results are provided in Supplementary Table S2.
Functional annotation of the sequencing data was performed using HUMAnN2 and successfully identified 401 functional pathways. Subsequent LEfSe analysis identified 20 metabolic pathways exhibiting statistically significant differential abundance between the case and control groups, as shown in Figure 1e. Among them, four pathways were significantly enriched in the case group: D-galactarate degradation I, Superpathway of beta-D-glucuronide, D-glucuronate degradation, and D-galacturonate degradation and Fatty acid salvage (Supplementary Table S3).
Analysis of the correlation between 11 differentially abundant species at the species level and clinical indicators showed that Collinsella aerofaciens positively associated with first-trimester fasting glucose levels (r = 0.19, p = 0.04), as did Clostridium bartlettii (r = 0.22, p = 0.01). The correlation results between other differential bacterial species and additional clinical parameters are presented in Figure 1f.
3.3. Serum Metabolomic Profiles and Correlations
OA total of 381 metabolites were identified in both case and control groups by untargeted metabolomic analysis via UPLC-MS. OPLS-DA showed an apparent separation of metabolic patterns between GDM cases and controls (Figure 2a,b). The stability of the OPLS-DA model was assessed through 200 permutation tests, suggesting acceptable model performance (Figure 2c,d).
We identified 64 significantly differential metabolites between women with GDM and non-GDM controls using the following stringent criteria: (1) absolute log2 fold change > 0.415, (2) Benjamini–Hochberg adjusted p-value < 0.05, and (3) variable importance in projection (VIP) score ≥ 1. The detailed characteristics of the 64 metabolites are provided in Supplementary Table S4.
Among these differentially expressed metabolites, 23 metabolites were upregulated in GDM cases (Figure 2e). GDM cases showed elevated levels of Glycerol-3-phosphate, Glycerophosphocholine, Phenylalanine, Tyrosine, and 3,4-Dihydroxybutyric acid, while controls had higher levels of Cystine, Tryptophan, Niacinamide, Sorbitol-6-phosphate, Stearic acid, Trifluoroacetic acid, 3,7-Trimethyluric acid, Thioguanine, and 8-Hydroxy-2′-deoxyguanosine, etc.
By calculating the Spearman correlation coefficients, the relationship between 14 significant differential serum metabolites and clinical indicators in the first trimester was examined. The results of the correlation matrix indicated that Stearic acid was positively associated with first-trimester fasting glucose levels (r = 0.18, p = 0.04), and conversely, 3,7-Trimethyluric acid (r = −0.18, p = 0.04), Niacinamide (r = −0.24, p = 0.008), and Trifluoroacetic acid (r = −0.19, p = 0.03) were negatively associated with first-trimester fasting glucose levels (Figure 2f).
Correlation analysis between 11 significantly differential microbial taxa in the gut microbiota and 14 significant differential serum metabolites indicated that six differential species at the species level were associated with seven differential metabolites. Among these, Alistipes putredinis was positively associated with trifluoroacetic acid (r = 0.18, p = 0.04) and negatively associated with glycerol-3-phosphate (r = −0.22, p = 0.02). Collinsella aerofaciens was negatively related to Stearic acid (r = −0.23, p = 0.01) and Alistipes finegoldii was positively related to 3–7-trimethyluric acid (r = 0.22, p = 0.01). The correlations between microbial taxa and metabolites are presented in Figure 2g.
4. Discussion
4.1. Gut Microbiota Alterations Associated with GDM
Consistent with previous studies, this study observed significant differences in gut microbiota composition, including reduced alpha diversity and increased beta diversity in GDM women compared with non-GDM controls, through the metagenomic analyses. Firmicutes, Proteobacteria, and Bacteroidetes were the predominant phyla, consistent with our previous findings [37] but with slight variations likely attributable to methodological differences between 16S rRNA sequencing and whole metagenome sequencing.
At the species level, eleven differential gut microbiota were identified (all p < 0.05). Collinsella aerofaciens and Clostridium bartlettii were enriched in the GDM group, while nine taxa, including Alistipes putredinis and Bacteroidales bacterium ph8, were more abundant in controls. The genus Collinsella has been reported to be increased in GDM [11] and T2DM patients [39,40], and associated with inflammatory and metabolic phenotypes in prior observational studies [41]. The enrichment of Clostridium bartlettii in the GDM group and its positive correlation with fasting blood glucose levels suggested an association with altered glucose-related metabolic states. Previous work also associated Clostridium bartlettii with metformin response in diabetic patients [42], and other Clostridium species have been reported in GDM-related studies [11,13,43]. In contrast, Bacteroides was enriched in controls [9,11], and has been reported to be associated with fatty acid oxidation and broader aspects of lipid metabolism and insulin sensitivity [44,45]. Two species—Bacteroidales bacterium ph8 and Bacteroides xylanisolvens—were particularly abundant in the control group. Bacteroidales bacterium ph8 has been reported to be reduced in women with type 1 diabetes [46] and associated with gut-derived metabolites such as p-cresyl sulfate.
Several taxa identified in this study have shown inconsistent associations with glucose-related metabolic traits across previous reports. For example, Oscillibacter has been variably associated with glucose-related metabolic traits, with both reduced abundance reported in individuals with impaired glucose tolerance or prediabetes [9,47,48] and enrichment observed in women with GDM [49]. In our cohort, Oscillibacter was more abundant in non-GDM controls than in GDM cases, further highlighting the heterogeneous directions of association. Similarly, Alistipes spp. were negatively correlated with fasting glucose in our cohort, in line with previous reports of similar associations [22,28,31]. In contrast, other studies have reported an increased abundance of Alistipes in T2DM [41,50], suggesting its dual role as both protective and proinflammatory depending on metabolic context [51]. In addition, conflicting patterns have been reported for Eubacterium species in diabetes-related studies [39,52,53,54,55,56]. While some reported elevated Eubacterium in diabetes [39,52], our findings, consistent with Karlsson et al. [56], suggested enriched Eubacterium eligens in controls. Streptococcus also displayed higher abundance in controls, in line with Zheng et al. [57]. Taken together, the relationships among gut microbiota at the species level, host metabolism, and GDM are highly heterogeneous, warranting further investigation.
4.2. Metabolomic Alterations Associated with GDM
Untargeted metabolomic profiling revealed 64 differential metabolites, primarily involving amino acid and lipid metabolism pathways. Elevated phenylalanine and tyrosine in the GDM group align with previous studies [29]. In contrast, tryptophan levels were higher in controls, in line with earlier observational and mechanistic studies highlighting alterations in amino acid metabolism and redox homeostasis in metabolic disorders [58,59,60,61]. Tryptophan showed a negative correlation with triglyceride levels [62]. However, prior studies have reported inconsistent patterns for tryptophan-related metabolites in GDM, highlighting heterogeneity across study populations.
Among lipid-related metabolites, glycerol-3-phosphate and glycerophosphocholine were elevated in GDM cases. Nicotinamide was increased in controls and showed inverse correlations with glucose and LDL levels, consistent with prior findings in metabolic disease contexts [63,64,65]. Other metabolites, including sorbitol-6-phosphate, stearic acid, and purine intermediates, also showed differed between groups, although their biological interpretation remains uncertain and requires further validation.
4.3. Integration of Gut Microbiota and Metabolomic Findings
Our study revealed distinct microbial and metabolomic profiles in women with GDM compared with non-GDM controls through integrated metagenomic and metabolomic analyses. Correlation analyses between gut microbes, serum metabolites, and clinical indicators identified patterns of association across these data layers. Alistipes putredinis, enriched in controls, was negatively correlated with glycerol-3-phosphate and fasting glucose. Bacteroidales bacterium ph8 was positively correlated with tryptophan and inversely correlated with triglyceride levels. Together, these cross-domain associations provide a speculative model linking gut microbes, circulating metabolites, and metabolic traits in early pregnancy.
4.4. Future Directions
Our integrated analysis highlights the association patterns between gut microbiota and host metabolism in GDM. Future longitudinal and functional studies should further evaluate these microbial and metabolic signatures over the course of pregnancy and assess their potential causal relevance, beyond simply testing previously proposed speculative pathways. Understanding the temporal dynamics of changes in microbiota and metabolite profiles during pregnancy may help improve understanding of early metabolic alterations related to GDM.
5. Limitations
This study has several limitations. The sample size was estimated based on the intergroup differences in microbial diversity; however, this approach primarily addresses community-level variations and does not guarantee sufficient statistical power for detailed species-level comparisons or for detecting nuanced metabolomic correlations. Consequently, the robustness of species-specific conclusions may be limited. Additionally, the limited sample size with a relatively stringent LDA threshold increases the risk of overfitting and false positives in subsequent multiple correlation analyses. Given the observational nature of the study and limited sample size, the findings should be interpreted with appropriate caution.
Dietary intake was not adjusted in the analysis for several reasons: (1) the primary aim of this study was to identify potential correlations between microbiota, metabolites, and clinical indicators. Accordingly, a power-oriented statistical strategy was adopted, aiming to increase the sensitivity and facilitate the detection of correlations with moderate effect sizes. Moreover, baseline dietary intake showed no statistically significant differences between the GDM and control groups, and no significant associations were detected in either univariate or multivariate analyses. (2) The dietary patterns of pregnant women in Hunan were relatively homogeneous during the first trimester due to traditional habits, which may have minimized the dietary impact on gut microbiota composition. (3) Strict inclusion and exclusion criteria were applied, with all participants having no history of GDM and no recent (within one month) use of antibiotics or probiotics, ensuring comparability of gut microbiota-influencing factors between groups.
We also noted that obesity was unequally distributed between the two groups at baseline. Obesity undoubtedly affects metabolism, potentially introducing bias into the results. Based on the complex interactions among obesity, gut microbiota, and metabolism, we included waist circumference as a clinical factor in the correlation analysis to better elucidate the relationship between obesity and specific metabolites.
Although this study provides insufficient evidence regarding the mechanisms by which gut microbes influence the onset of GDM, it still offers an overview of gut microbiota and metabolites related to GDM, providing hypotheses for further mechanistic research.
6. Conclusions
Despite these limitations, this study provides valuable insights into the relationship between gut microbiota, serum metabolites, and GDM. Using a nested case–control design within a well-characterized cohort of pregnant women, this study suggested that alterations in gut microbiota during the first trimester are associated with later GDM status.
The multi-omics strategy—particularly the use of metagenomics—enhanced sequencing depth and enabled the identification of differential taxa at the species level. Furthermore, the integration of non-targeted metabolomics allowed for the identification of metabolic pathways potentially influenced by the gut microbiota. Together, these findings provide exploratory insights of the development of GDM and new clues for early prevention and intervention strategies.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Nguyen C.L. Pham N.M. Binns C.W. Duong D.V. Lee A.H. Prevalence of Gestational Diabetes Mellitus in Eastern and Southeastern Asia: A Systematic Review and Meta-Analysis J. Diabetes Res.20182018653697410.1155/2018/653697429675432 PMC 5838488 · doi ↗ · pubmed ↗
- 2Malaza N. Masete M. Adam S. Dias S. Nyawo T. Pheiffer C. A Systematic Review to Compare Adverse Pregnancy Outcomes in Women with Pregestational Diabetes and Gestational Diabetes Int. J. Environ. Res. Public Health 2022191084610.3390/ijerph 19171084636078559 PMC 9517767 · doi ↗ · pubmed ↗
- 3Kramer C.K. Campbell S. Retnakaran R. Gestational Diabetes and the Risk of Cardiovascular Disease in Women: A Systematic Review and Meta-Analysis Diabetologia 20196290591410.1007/s 00125-019-4840-230843102 · doi ↗ · pubmed ↗
- 4Tobias D.K. Hu F.B. Forman J.P. Chavarro J. Zhang C. Increased Risk of Hypertension after Gestational Diabetes Mellitus: Findings from a Large Prospective Cohort Study Diabetes Care 2011341582158410.2337/dc 11-026821593289 PMC 3120181 · doi ↗ · pubmed ↗
- 5Medici Dualib P. Ogassavara J. Mattar R. Mariko Koga da Silva E. Atala Dib S. de Almeida Pititto B. Gut Microbiota and Gestational Diabetes Mellitus: A Systematic Review Diabetes Res. Clin. Pract.202118010907810.1016/j.diabres.2021.10907834599971 · doi ↗ · pubmed ↗
- 6Hasain Z. Mokhtar N.M. Kamaruddin N.A. Mohamed Ismail N.A. Razalli N.H. Gnanou J.V. Raja Ali R.A. Gut Microbiota and Gestational Diabetes Mellitus: A Review of Host-Gut Microbiota Interactions and Their Therapeutic Potential Front. Cell. Infect. Microbiol.20201018810.3389/fcimb.2020.0018832500037 PMC 7243459 · doi ↗ · pubmed ↗
- 7Wang J. Zheng J. Shi W. Du N. Xu X. Zhang Y. Ji P. Zhang F. Jia Z. Wang Y. Dysbiosis of Maternal and Neonatal Microbiota Associated with Gestational Diabetes Mellitus Gut 2018671614162510.1136/gutjnl-2018-31598829760169 PMC 6109274 · doi ↗ · pubmed ↗
- 8Wang X. Liu H. Li Y. Huang S. Zhang L. Cao C. Baker P.N. Tong C. Zheng P. Qi H. Altered Gut Bacterial and Metabolic Signatures and Their Interaction in Gestational Diabetes Mellitus Gut Microbes 202012184076510.1080/19490976.2020.184076533222612 PMC 7714515 · doi ↗ · pubmed ↗
