# Integrated bioinformatics and mendelian randomization reveal a six-gene diagnostic signature and key role of CYP26B1 in sarcopenia

**Authors:** Yaoqi Wu, Xiaoqing Cai, Shiwen Fan, Lina Zhao, Yingying Jiao, Tongkai Chen, Manting Liu, Yafang Song

PMC · DOI: 10.3389/fmolb.2026.1760938 · Frontiers in Molecular Biosciences · 2026-02-26

## TL;DR

This study identifies a six-gene signature for diagnosing sarcopenia and finds that CYP26B1 has a causal role in the disease, offering new diagnostic and therapeutic possibilities.

## Contribution

The study introduces a novel six-gene diagnostic model and establishes CYP26B1 as a causal gene in sarcopenia through integrated bioinformatics and Mendelian randomization.

## Key findings

- A six-gene diagnostic signature (FOXO1, ZBTB16, HOXB2, LYVE1, MGP, CYP26B1) was developed with high predictive accuracy (AUC >0.80).
- CYP26B1 was confirmed as a causal gene in sarcopenia via Mendelian randomization, linking retinoic acid metabolism to disease risk.
- qPCR validation confirmed the mRNA expression patterns observed in the Gene Expression Omnibus dataset.

## Abstract

The pathogenesis of sarcopenia involves complex molecular mechanisms, and treatment remains challenging, with a lack of reliable diagnostic biomarkers. The objective of this study is to identify biomarkers that may be linked to sarcopenia, examine how these biomarkers correlate with immune cell infiltration, and investigate the genes that exhibit a causal relationship with sarcopenia.

Four transcriptomic datasets were integrated to identify candidate biomarkers. Genes from the MEBrown module of weighted gene co-expression network analysis (WGCNA) analysis were cross-referenced with differentially expressed genes (DEGs). A diagnostic model was built using 113 machine learning algorithms, followed by protein-protein interaction (PPI) network analysis and SHapley Additive exPlanations (SHAP) evaluation. Immune cell quantification and correlation with sarcopenia-related genes were performed using CIBERSORT, while gene expression data was integrated with genome-wide association statistics (GWAS) and gene expression quantitative trait loci (eQTL) data. In vitro validation was carried out using C2C12 cells and quantitative polymerase chain reaction (qPCR) experiments.

We found 318 DEGs. By comparing the WGCNA gene with these DEGs, we found 109 possible biomarkers, which are related to immune regulation, muscle cytoskeleton regulation and retinol metabolism. A six-gene diagnostic signature (FOXO1, ZBTB16, HOXB2, LYVE1, MGP, and CYP26B1) was developed using machine learning and PPI network analysis, achieving high predictive accuracy (AUC >0.80), with HOXB2 identified as the top predictor via SHAP analysis. CIBERSORT analysis showed the relationship between these genes and immune cell subsets, while Mendelian randomization (MR) analysis confirmed the causal relationship between the expression of CYP26B1 gene and the risk of sarcopenia. The result of qPCR analysis is the same as the mRNA expression found in Gene Expression Omnibus (GEO) data set.

This study identified a highly reliable six-gene diagnostic signature for sarcopenia. Mendelian randomization established CYP26B1 as the sole causal factor, linking retinoic acid metabolism to disease etiology. This dual evidence provides a robust six-gene diagnostic model and a prioritized therapeutic target, elucidating immune-metabolic mechanisms of sarcopenia. These findings offer new avenues for early diagnosis and metabolism-based precision therapy.

## Linked entities

- **Genes:** FOXO1 (forkhead box O1) [NCBI Gene 2308], ZBTB16 (zinc finger and BTB domain containing 16) [NCBI Gene 7704], HOXB2 (homeobox B2) [NCBI Gene 3212], LYVE1 (lymphatic vessel endothelial hyaluronan receptor 1) [NCBI Gene 10894], MGP (matrix Gla protein) [NCBI Gene 4256], CYP26B1 (cytochrome P450 family 26 subfamily B member 1) [NCBI Gene 56603]

## Full-text entities

- **Genes:** Hoxb2 (homeobox B2) [NCBI Gene 103889] {aka Hox-2.8, Hoxbes2}, Mgp (matrix Gla protein) [NCBI Gene 17313] {aka Mglap}, Lyve1 (lymphatic vessel endothelial hyaluronan receptor 1) [NCBI Gene 114332] {aka 1200012G08Rik, Crsbp-1, Lyve-1, Xlkd1}, Zbtb16 (zinc finger and BTB domain containing 16) [NCBI Gene 235320] {aka PLZF, Zfp145, lu}, Foxo1 (forkhead box O1) [NCBI Gene 56458] {aka Afxh, FKHR, Fkhr1, Foxo1a}, Cyp26b1 (cytochrome P450, family 26, subfamily b, polypeptide 1) [NCBI Gene 232174] {aka CP26, P450RAI-2}
- **Diseases:** sarcopenia (MESH:D055948)
- **Chemicals:** retinoic acid (MESH:D014212)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12979158/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12979158/full.md

## References

98 references — full list in the complete paper: https://tomesphere.com/paper/PMC12979158/full.md

---
Source: https://tomesphere.com/paper/PMC12979158