# Integrative machine learning and multi-omics framework identifies shared biomarkers for rheumatoid arthritis and ulcerative colitis

**Authors:** Meili Liu, Jun Ge, Lei Guo, Zhengzheng Wu, Zimo Cheng, Zenggen Liu, Yi Liu

PMC · DOI: 10.1371/journal.pone.0336243 · PLOS One · 2025-11-10

## TL;DR

A machine learning approach identifies shared biomarkers for rheumatoid arthritis and ulcerative colitis, linking them to macrophage-driven inflammation and VEGF signaling.

## Contribution

An integrative machine learning and multi-omics framework identifies shared biomarkers and immune infiltration patterns in rheumatoid arthritis and ulcerative colitis.

## Key findings

- 19 shared differentially expressed genes were identified, with IL-17 signaling highlighted as dysregulated.
- Four key biomarkers (DUOX2, IDO1, NPY1R, SELL) showed high diagnostic performance and were localized to pro-inflammatory macrophages.
- IDO1 and NPY1R expression changes were confirmed in RA-like and UC-like inflammation models using qRT-PCR.

## Abstract

Rheumatoid arthritis (RA) and ulcerative colitis (UC) are chronic inflammatory diseases with shared immune pathologies but limited common diagnostic biomarkers, which hinders the development of targeted therapies.

Public gene expression datasets were analyzed to identify differentially expressed genes (DEGs) common to both RA and UC. Functional enrichment and immune infiltration analyses revealed dysregulated pathways. A comprehensive machine learning framework that incorporated 12 algorithms and cross-validation was applied to screen for robust diagnostic biomarkers. Further, RA- and UC-related molecular subtypes were delineated, and the relationship between these shared biomarkers and immune infiltration characteristics was explored. Key findings were validated using single-cell RNA sequencing (scRNA-seq) of UC tissue to localize gene expression and qRT-PCR in cell models mimicking RA and UC.

Analysis identified 19 shared DEGs, with functional enrichment analysis highlighting IL-17 signaling. Machine learning prioritized four key biomarkers (DUOX2, IDO1, NPY1R, SELL) with high diagnostic performance. scRNA-seq localized these genes predominantly to a pro-inflammatory “Macrophage-High” subpopulation and revealed VEGF-mediated crosstalk with endothelial cells. qRT-PCR confirmed significant expression changes of IDO1 and NPY1R in both RA-like and UC-like inflammation models.

This integrative approach identifies DUOX2, IDO1, NPY1R, and SELL as shared RA-UC biomarkers potentially linked to macrophage-driven inflammation and VEGF signaling. These findings provide insights into the common pathogenesis and potential targets for dual-disease diagnostics and therapeutics.

## Linked entities

- **Genes:** DUOX2 (dual oxidase 2) [NCBI Gene 50506], IDO1 (indoleamine 2,3-dioxygenase 1) [NCBI Gene 3620], NPY1R (neuropeptide Y receptor Y1) [NCBI Gene 4886], SELL (selectin L) [NCBI Gene 6402]
- **Diseases:** rheumatoid arthritis (MONDO:0008383), ulcerative colitis (MONDO:0005101)

## Full-text entities

- **Genes:** NPY1R (neuropeptide Y receptor Y1) [NCBI Gene 4886] {aka NPY1-R, NPYR}, SELL (selectin L) [NCBI Gene 6402] {aka CD62L, LAM1, LECAM1, LEU8, LNHR, LSEL}, IDO1 (indoleamine 2,3-dioxygenase 1) [NCBI Gene 3620] {aka IDO, IDO-1, INDO}, VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, DUOX2 (dual oxidase 2) [NCBI Gene 50506] {aka LNOX2, NOXEF2, P138-TOX, TDH6, THOX2}, IL17A (interleukin 17A) [NCBI Gene 3605] {aka CTLA-8, CTLA8, IL-17, IL-17A, IL17, ILA17}
- **Diseases:** UC (MESH:D003093), inflammation (MESH:D007249), RA (MESH:D001172)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12599921/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12599921/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12599921/full.md

---
Source: https://tomesphere.com/paper/PMC12599921