# Identification of secretory protein related biomarkers for primary biliary cholangitis based on machine learning and experimental validation

**Authors:** Zihao Xu, Yue Cai, Yifan Liu, Jun Xu, Sheng Guo, Lihan Zhou, Yang Ji, Lei Zhan, Liangbin Cheng

PMC · DOI: 10.1515/biol-2025-1192 · 2025-12-30

## TL;DR

This study identifies secretory proteins as potential biomarkers for early diagnosis of primary biliary cholangitis using machine learning and experimental validation.

## Contribution

The study introduces novel secretory protein biomarkers for PBC diagnosis, validated through machine learning and animal experiments.

## Key findings

- Machine learning identified 14-18 key diagnostic genes with an AUC of 0.96.
- Upregulated genes like CSF1R, PLCH2, and SLC38A1, and downregulated CST7 were highlighted as significant.
- Animal experiments confirmed the bioinformatics findings.

## Abstract

Primary biliary cholangitis (PBC) is a chronic autoimmune liver disease that causes bile duct damage, liver fibrosis, and cirrhosis, significantly affecting patients’ lives and healthcare costs. Early diagnosis is critical but is hindered by the limited sensitivity of existing biomarkers, particularly in patients who are negative for anti-mitochondrial antibodies. This limitation underscores the need for more reliable biomarkers. Our study focuses on secretory proteins as potential diagnostic biomarkers and aims to elucidate gene expression profiles associated with PBC using bioinformatics methods and machine learning. We identified 827 downregulated and 639 upregulated genes related to mitochondrial function and immune pathways. Additionally, Weighted Gene Co-expression Network Analysis revealed a blue module comprising 1,949 genes linked to PBC. Machine learning identified between 14 and 18 key diagnostic genes. Using a Gaussian Mixture Model, we achieved an area under the curve of 0.96, indicating excellent diagnostic performance. Notable genes included the upregulated CSF1R, PLCH2, and SLC38A1, as well as the downregulated CST7. Animal experiments further supported these bioinformatics findings. Our research highlights secretory proteins as promising biomarkers for the early diagnosis of PBC, with potential applications in developing precise diagnostic tools and personalized therapies. This work paves the way for future studies involving larger cohorts and multi-omics data.

## Linked entities

- **Genes:** CSF1R (colony stimulating factor 1 receptor) [NCBI Gene 1436], PLCH2 (phospholipase C eta 2) [NCBI Gene 9651], SLC38A1 (solute carrier family 38 member 1) [NCBI Gene 81539], CST7 (cystatin F) [NCBI Gene 8530]
- **Diseases:** Primary biliary cholangitis (MONDO:0005388), cirrhosis (MONDO:0005155)

## Full-text entities

- **Genes:** CST7 (cystatin F) [NCBI Gene 8530] {aka CMAP}, CSF1R (colony stimulating factor 1 receptor) [NCBI Gene 1436] {aka BANDDOS, C-FMS, CD115, CSF-1R, CSFR, FIM2}, PLCH2 (phospholipase C eta 2) [NCBI Gene 9651] {aka PLC-L4, PLC-eta2, PLCL4, PLCeta2}, SLC38A1 (solute carrier family 38 member 1) [NCBI Gene 81539] {aka ATA1, NAT2, SAT1, SNAT1}
- **Diseases:** cirrhosis (MESH:D005355), PBC (MESH:D008105), liver fibrosis (MESH:D008103), autoimmune liver disease (MESH:D008107), bile duct damage (MESH:D001649)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13014323/full.md

---
Source: https://tomesphere.com/paper/PMC13014323