# Multi-omics integration and machine learning reveal gut-immune signatures in idiopathic pulmonary fibrosis: insights from bulk RNA-seq, single-cell profiles, spatial transcriptomics, and experimental validation

**Authors:** Zhengyu Hu, Jiaqi Wang, Jialin Yu, Zheqing Hu, Jing Xue, Zhanbing Ma, Miaomiao Nian, Ruixin Qi, Tingting Zhao, Xia Cao, Hongxia Xin, Xiuyan Wang, Guilan Yang, Zhenzhen Gui, Xiaoming Liu, Juan Chen

PMC · DOI: 10.3389/fimmu.2026.1730289 · 2026-03-19

## TL;DR

This study uses multi-omics data and machine learning to identify key genes and immune patterns in idiopathic pulmonary fibrosis, revealing a gut-immune-lung connection and potential new treatments.

## Contribution

The study integrates diverse omics data and experimental validation to identify novel diagnostic signatures and therapeutic targets in IPF.

## Key findings

- CXCL13, IL33, TLR4, and IGF1 are core genes linked to immune infiltration and fibrotic remodeling in IPF.
- A four-gene model effectively distinguishes IPF from controls across multiple datasets.
- Mendelian randomization supports a causal gut-immune-lung axis in IPF pathogenesis.

## Abstract

Idiopathic pulmonary fibrosis (IPF) is a progressive, fatal lung disease with limited treatment options and a poor prognosis. Recent studies suggest a critical role for the gut–immune–lung axis in IPF, yet the underlying molecular mechanisms remain unclear.

The current study performed in silico multi-omics integration of publicly available datasets, including bulk RNA-seq, single-cell and spatial transcriptomics, as well as peripheral blood multi-omics data to uncover key molecular signatures in IPF. Furthermore, machine learning techniques were utilized to identify core genes, whereas functional analyses and Mendelian randomization were conducted to evaluate the causal relationships among gut microbiota, immune cells, and IPF. Additionally, experimental validation using qPCR and ELISA assays was conducted in vitro, in vivo, and in patient plasma to confirm the expression patterns of key genes.

Across integrated public bulk, single-cell, spatial, and blood multi-omics, CXCL13, IL33, TLR4, and IGF1 were identified as core IPF genes consistently linked to immune infiltration and fibrotic remodeling. Deconvolution, scRNA-seq, and spatial mapping localized their dysregulation to fibroblasts and immune compartments (notably B-cell, macrophage, and mast-cell axes), highlighting fibroblast–immune crosstalk in fibrotic foci. A four-gene model robustly distinguished IPF from controls across cohorts. Mendelian randomization supported a gut–immune–lung axis, indicating causal effects of specific gut taxa on IPF risk via immune phenotypes. qPCR/ELISA in TGF-β1–stimulated fibroblasts, bleomycin mouse lungs, and patient plasma corroborated upregulation of IL33, CXCL13, IGF1 and downregulation of TLR4. Drug-signature reversal nominated cucurbitacin I and temsirolimus; molecular docking was performed as a preliminary in silico, computer-simulation–based assessment of potential ligand–protein interactions between these compounds and the four core targets.

This study provides new insights into the importance of gut–immune–lung axis in IPF and identifies CXCL13, IL33, TLR4, and IGF1 as diagnostic signatures and therapeutic targets. By integrating public multi-omics resources with experimental validation, our findings offer a foundation for future diagnostic and treatment strategies aimed at modulating the gut microbiota and immune system in IPF.

## Linked entities

- **Genes:** CXCL13 (C-X-C motif chemokine ligand 13) [NCBI Gene 10563], IL33 (interleukin 33) [NCBI Gene 90865], TLR4 (toll like receptor 4) [NCBI Gene 7099], IGF1 (insulin like growth factor 1) [NCBI Gene 3479]
- **Chemicals:** cucurbitacin I (PubChem CID 5281321), temsirolimus (PubChem CID 6918289)
- **Diseases:** idiopathic pulmonary fibrosis (MONDO:0800029), IPF (MONDO:0800504)

## Full-text entities

- **Genes:** IGF1 (insulin like growth factor 1) [NCBI Gene 3479] {aka IGF, IGF-I, IGFI, MGF}, CXCL13 (C-X-C motif chemokine ligand 13) [NCBI Gene 10563] {aka ANGIE, ANGIE2, BCA-1, BCA1, BLC, BLR1L}, TLR4 (toll like receptor 4) [NCBI Gene 7099] {aka ARMD10, CD284, TLR-4, TOLL}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, IL33 (interleukin 33) [NCBI Gene 90865] {aka C9orf26, DVS27, IL1F11, NF-HEV, NFEHEV}
- **Diseases:** lung disease (MESH:D008171), IPF (MESH:D054990)
- **Chemicals:** cucurbitacin I (MESH:C038106), temsirolimus (MESH:C401859), bleomycin (MESH:D001761)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13043422/full.md

---
Source: https://tomesphere.com/paper/PMC13043422