# Application of a machine learning model based on routine clinical parameters for the diagnosis of rheumatoid arthritis with concomitant osteoporosis: a retrospective study

**Authors:** Zhe Wu, Zhaohui Li, Weifeng Li, Shengren Xiong

PMC · DOI: 10.7717/peerj.20852 · PeerJ · 2026-02-27

## TL;DR

This study uses machine learning with clinical data to predict osteoporosis in rheumatoid arthritis patients, offering a potential alternative to DEXA scans.

## Contribution

A novel machine learning approach using routine clinical parameters for diagnosing osteoporosis in rheumatoid arthritis patients is proposed.

## Key findings

- The support vector machine achieved the highest AUC (86.5%) for predicting osteoporosis in RA patients.
- Age, sex, and BMI were identified as major contributors to model predictions.
- Inflammatory markers like PLR, NPAR, WBC, and NLR showed modest influence on predictions.

## Abstract

Rheumatoid arthritis (RA) is commonly complicated by secondary osteoporosis (OP), affecting up to 80% of patients. Although dual-energy X-ray absorptiometry (DEXA) is the diagnostic gold standard, its limited accessibility highlights the need for alternative tools. In this retrospective cohort study of 396 hospitalized RA patients, we developed machine learning models using demographic and routine laboratory data to identify concomitant OP. Five classifiers were evaluated and combined via a soft-voting ensemble. The support vector machine achieved the highest area under the curve (AUC) (86.5%), while the random forest showed the highest accuracy (81.5%). The ensemble model demonstrated balanced performance (AUC 83.2%; accuracy 81.1%). SHapley Additive exPlanations (SHAP) analysis indicated age, sex, and body mass index (BMI) as major contributors, whereas albumin and inflammatory markers—including platelet-to-lymphocyte ratio (PLR), neutrophil percentage/albumin ratio (NPAR), white blood cell count (WBC), and neutrophil-to-lymphocyte ratio (NLR)—showed modest but heterogeneous influences on model predictions. These findings suggest that machine learning models incorporating routinely collected clinical data offer a practical and interpretable approach for preliminary OP risk assessment in RA. However, given the single-center design and limited sample size, the results should be considered exploratory, and larger external validation studies are warranted.

## Linked entities

- **Diseases:** rheumatoid arthritis (MONDO:0008383), osteoporosis (MONDO:0005298)

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** Inflammation (MESH:D007249), NLR (MESH:D015467), Fracture (MESH:D050723), malignancy (MESH:D009369), osteoporotic fracture (MESH:D058866), obesity (MESH:D009765), autoimmune disease (MESH:D001327), muscle mass loss (MESH:C536030), nutritional disturbances (MESH:D009748), RA (MESH:D001172), synovitis (MESH:D013585), bone loss (MESH:D001847), OP (MESH:D010024), rheumatic conditions (MESH:D012216), systemic (MESH:D015619), immune dysregulation (OMIM:614878)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12951883/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12951883/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12951883/full.md

---
Source: https://tomesphere.com/paper/PMC12951883