# Predicting inter-microbial host specificity in oral biofilms using a lightweight relation-aware knowledge graph model

**Authors:** Prabhu Manickam Natarajan, Sudhir Rama Varma, Jayaraj Kodangattil Narayanan, Ruba Odeh

PMC · DOI: 10.3389/fcimb.2026.1775191 · Frontiers in Cellular and Infection Microbiology · 2026-02-20

## TL;DR

This paper introduces a new graph-based model to predict how viruses and bacteria interact in the human mouth, improving understanding of oral diseases.

## Contribution

A lightweight, relation-aware knowledge graph model called IK-BRNet is introduced for predicting microbial host specificity in oral biofilms.

## Key findings

- IK-BRNet outperformed a conventional GAT model with a higher AUC-ROC score of 0.929.
- The model improved sensitivity for disease-associated viral taxa, reaching 93.8% compared to 56.3% in the baseline.
- Site-specific predictions showed higher disease scores for plaque-associated viruses and lower scores in healthy niches.

## Abstract

The human oral cavity hosts a complex microbial ecosystem of bacteria, viruses, bacteriophages, and other microorganisms forming biofilms in different niches. Phage–bacteria host specificity is crucial in shaping microbial community, stability, and dysbiosis. mapping this specificity is limited by experimental constraints and traditional methods can’t capture ecological complexity. The goal is to create a graph-based model that treats inter-microbial host specificity as a relational learning problem, integrating taxonomic, ecological, and infection data into a knowledge graph. This improves phage–bacteria host predictions and reveals microbial hubs and interaction patterns related to periodontal disease dysbiosis.

This study introduces a lightweight, relation-aware knowledge graph for predicting microbial host specificity in oral biofilms. We built a heterogeneous graph of the oral microbiome, incorporating microbial taxa, anatomical sites, taxonomic hierarchies, enrichment patterns, and INFECTS relationships. The dataset includes 500 viral taxa across four oral niches, with 21,338 significant co-occurrence relationships and various biological features. To learn meaningful representations, we combined graph embeddings with microbial features. We developed a relation-aware graph neural network, IK-BRNet, to efficiently encode ecological and interaction semantics.

Model performance was evaluated against a conventional Graph Attention Network (GAT) using stratified training, validation, and test splits with class imbalance correction. IK-BRNet demonstrated faster convergence and superior discrimination ability, achieving a higher AUC-ROC (0.929 vs. 0.904) and markedly improved sensitivity for disease-associated viral taxa (93.8% vs. 56.3%). While the baseline GAT achieved higher accuracy and specificity, IK-BRNet consistently reduced false negatives, thereby improving its ability to detect disease-related microbial signals. Site-specific predictions confirmed biological validity, with the highest disease scores for dental plaque–associated viruses and lower scores in healthy niches such as the tongue and buccal mucosa.

This study shows that relation-aware graph learning offers a meaningful and efficient way to model inter-microbial host specificity in oral biofilms. The framework improves oral microbiome network inference and supports disease screening, ecological analysis, and microbiome-based dentistry.

## Linked entities

- **Diseases:** periodontal disease (MONDO:0002635)

## Full-text entities

- **Diseases:** INFECTS (MESH:D007239), PN (MESH:C565820), Periodontal diseases (MESH:D010510), diabetics (MESH:D003920), dysbiosis (MESH:D064806), disease (MESH:D004194), periodontal health (MESH:D010518), inflammation (MESH:D007249), microbial disease (MESH:D015163)
- **Species:** Homo sapiens (human, species) [taxon 9606], Fusobacterium (genus) [taxon 848], Porphyromonas gingivalis (species) [taxon 837], Bacteriophage sp. (species) [taxon 38018]
- **Cell lines:** MI-RGC — Mus musculus (Mouse), Transformed cell line (CVCL_4059)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12963330/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12963330/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12963330/full.md

---
Source: https://tomesphere.com/paper/PMC12963330