# Integrated bioinformatics and machine learning to explore the common mechanisms and potential biomarkers between periodontitis and preterm birth

**Authors:** Feng Mei, Yutong Liu, Wenting Xu, Ruoyun Liu, Xinlin Wang, Tingting Li, Yi Chen, Tingting Wu, Wei Zhang

PMC · DOI: 10.3389/fcell.2026.1763374 · Frontiers in Cell and Developmental Biology · 2026-02-20

## TL;DR

This study explores the link between periodontitis and preterm birth, identifying CD53 and BIN2 as potential shared biomarkers and suggesting immune response as a common mechanism.

## Contribution

The study introduces CD53 and BIN2 as novel potential biomarkers for periodontitis and preterm birth through integrated bioinformatics and machine learning analysis.

## Key findings

- CD53 and BIN2 are upregulated in both periodontitis and preterm birth and show strong diagnostic performance (AUC > 0.7).
- Common differentially expressed genes are enriched in immune-related pathways, suggesting immune response as a shared mechanism.
- CD53 and BIN2 exhibit high connectivity in the protein-protein interaction network and are upregulated in a periodontitis animal model.

## Abstract

There is accumulating evidence suggesting an association between periodontitis (PD) and preterm birth (PTB), but the underlying mechanisms have not been fully elucidated. This study aims to explore potential biomarkers and mechanisms between PD and PTB through integrated bioinformatics and machine learning approaches.

Datasets for PD (GSE16134 and GSE10334) and PTB (GSE203507, GSE174415, GSE18809, GSE73685 and GSE120480) were acquired from Gene Expression Omnibus (GEO). Then we performed Weighted gene co-expression network analysis (WGCNA), differential expressed genes (DEGs) analysis and three machine learning algorithms to identify cross-talk genes. To evaluate the potential of cross-talk genes as diagnostic biomarkers for PD and PTB, receiver operating characteristic (ROC) curve analysis and expression analysis were conducted. We then conducted functional enrichment analysis to elucidate the biological roles of the common DEGs. Single-sample gene set enrichment analysis (ssGSEA) assessed immune cell patterns of PD and PTB and biomarker-immune cell correlations. Additionally, we constructed a protein-protein interaction (PPI) network and further analyzed potential biomarkers using the cytoHubba plugin in Cytoscape software. Ultimately, the expression of the core genes in the PD animal model were validated.

We identified four cross-talk genes through the integrated analysis. Common DEGs were mainly concentrated in immune-related pathways. Following expression analysis and ROC curve analysis, we identified two genes (CD53 and BIN2) as potential biomarkers for PD and PTB. These genes were upregulated in disease groups compared to controls and exhibited strong diagnostic performance (AUC > 0.7) in both the training and validation cohorts. Moreover, CD53 and BIN2 displayed high connectivity within the PPI network. Immune cell infiltration analysis revealed that multiple immune cell types exhibited consistent upregulation in both diseases. In the PD model, consistent upregulation of CD53 and BIN2 was observed in the maxillary bone.

We identified two potential biomarkers (CD53 and BIN2) for the concurrent diagnosis of PD and PTB, and suggested that the potential common mechanism of these two diseases may be correlated with the immune response. This study provides novel insights into the pathogenesis of both diseases, thereby informing future preventive, diagnostic and therapeutic strategies.

## Linked entities

- **Genes:** CD53 (CD53 molecule) [NCBI Gene 963], BIN2 (bridging integrator 2) [NCBI Gene 51411]
- **Diseases:** periodontitis (MONDO:0005076)

## Full-text entities

- **Genes:** IFNG (interferon gamma) [NCBI Gene 3458] {aka IFG, IFI, IMD69}, BIN2 (bridging integrator 2) [NCBI Gene 51411] {aka BRAP-1}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, Cd53 (CD53 antigen) [NCBI Gene 12508] {aka Ox-44, Tspan25}, FCER1G (Fc epsilon receptor Ig) [NCBI Gene 2207] {aka FCRG}, IL2 (interleukin 2) [NCBI Gene 3558] {aka IL-2, TCGF, lymphokine}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, CD53 (CD53 molecule) [NCBI Gene 963] {aka MOX44, TSPAN25}, Bin2 (bridging integrator 2) [NCBI Gene 668218], IL17A (interleukin 17A) [NCBI Gene 3605] {aka CTLA-8, CTLA8, IL-17, IL-17A, IL17, ILA17}, POU2F3 (POU class 2 homeobox 3) [NCBI Gene 25833] {aka Epoc-1, OCT-11, OCT11, OTF-11, PLA-1, PLA1}, PTBP1 (polypyrimidine tract binding protein 1) [NCBI Gene 5725] {aka HNRNP-I, HNRNPI, HNRPI, PTB, PTB-1, PTB-T}, POTEF (POTE ankyrin domain family member F) [NCBI Gene 728378] {aka A26C1B, POTE2alpha, POTEACTIN}, NCAM1 (neural cell adhesion molecule 1) [NCBI Gene 4684] {aka CD56, MSK39, NCAM}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}, IL10 (interleukin 10) [NCBI Gene 3586] {aka CSIF, GVHDS, IL-10, IL10A, TGIF}
- **Diseases:** maternal (MESH:D000079262), immune dysregulation (OMIM:614878), chronic (MESH:D002908), pregnancy outcomes (MESH:D011254), necrotizing enterocolitis (MESH:D020345), systemic (MESH:D015619), infection (MESH:D007239), cardiovascular and metabolic diseases (MESH:D002318), PTB (MESH:D047928), MF (MESH:C567116), APOs (MESH:D011248), atherosclerosis (MESH:D050197), intracerebral hemorrhage (MESH:D002543), death (MESH:D003643), respiratory distress syndrome (MESH:D012128), bleeding (MESH:D006470), diabetes (MESH:D003920), PE (MESH:D011225), fetal growth restriction (MESH:D005317), tooth loss (MESH:D016388), PD (MESH:D010518), inflammation (MESH:D007249), diseases (MESH:D004194), neurodegenerative disorders (MESH:D019636)
- **Chemicals:** PVDF (MESH:C024865), lipid (MESH:D008055), TRIzol Reagent (-), sodium dodecyl sulfate (MESH:D012967), 8-OHdG (MESH:D000080242), nitrogen (MESH:D009584), sodium pentobarbital (MESH:D010424)
- **Species:** Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** C57BL/6 — Mus musculus (Mouse), Transformed cell line (CVCL_C0MU)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12963270/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12963270/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC12963270/full.md

---
Source: https://tomesphere.com/paper/PMC12963270