# Causal Effects of Lifestyle and Dietary Factors on Rheumatoid Arthritis: An Integrated Analysis Combining Mendelian Randomization, Machine Learning, and Evaluation of Burden Dynamics and Health Inequality

**Authors:** Yan Gao, Guangxin Gu, Ruiwen Wang, Wenfeng Han, Bin Zheng, Aoxiang Yang, Ning Wang, Hailong Yu, Chen Jia, Yu Wang

PMC · DOI: 10.1002/fsn3.71584 · Food Science & Nutrition · 2026-02-22

## TL;DR

This study finds that lifestyle and dietary factors like obesity and smoking increase rheumatoid arthritis risk, while others like cheese intake may protect against it, and machine learning helps predict RA risk effectively.

## Contribution

The study integrates Mendelian randomization, machine learning, and global burden analysis to identify causal lifestyle and dietary links to rheumatoid arthritis and assess health inequalities.

## Key findings

- Obesity, smoking, and poultry intake increase RA risk, while pork and cheese intake are protective.
- Random forest machine learning achieved high RA risk prediction accuracy with age as the top predictor.
- Global RA burden is projected to rise through 2050 due to aging populations and persistent health disparities.

## Abstract

Rheumatoid arthritis (RA) is a chronic autoimmune disease contributing to global morbidity and disability. Despite its growing burden, causal links between lifestyle, dietary factors, and RA remain unclear. This study investigates causal links between lifestyle, dietary factors, and RA using Mendelian randomization (MR) and machine learning (ML). Two‐sample MR analyzed 42 lifestyle and dietary exposures, while an RA risk prediction model was developed using nationally representative data and nine ML algorithms. Global Burden of Disease data assessed health inequalities and RA burden trends through frontier, decomposition, and Bayesian Age‐Period‐Cohort analyses. MR identified obesity, current smoking, sleeplessness, poultry intake, and salt added to food as RA risk factors, while never smoking, pork consumption, and cheese intake were protective. Random forest showed superior predictive performance, with age as the most influential predictor; seven features exhibited nonlinear RA risk associations. RA burden revealed gender and regional disparities, with frontier analysis indicating potential for burden reduction in multiple countries. Between 1990 and 2021, global RA burden rose due to population growth and aging, with projections suggesting continued increases through 2050. These findings highlight the importance of targeted lifestyle and dietary interventions to reduce RA burden and address health inequities in high‐risk populations.

Using two‐sample Mendelian randomization, higher BMI, obesity class 2, current smoking, sleeplessness, poultry intake, and salt added to food were causally associated with increased rheumatoid arthritis (RA) risk, while never smoking, pork intake, and cheese intake showed protective associations. A machine‐learning framework identified seven key predictors and achieved robust discrimination (random forest AUROC = 0.908) with good calibration and net benefit. Global burden analyses revealed increasing RA prevalence and DALYs from 1990 to 2021 with persistent health inequalities, and projections suggest continued growth through 2050.

## Linked entities

- **Diseases:** rheumatoid arthritis (MONDO:0008383)

## Full-text entities

- **Genes:** CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, FOXP3 (forkhead box P3) [NCBI Gene 50943] {aka AIID, DIETER, IPEX, JM2, PIDX, XPID}, MFSD11 (major facilitator superfamily domain containing 11) [NCBI Gene 79157] {aka ET}
- **Diseases:** chronic pain (MESH:D059350), joint pain (MESH:D018771), MR (MESH:C562757), GBD (MESH:D001037), SII (MESH:C566784), polyarthritis (MESH:D001168), RA (MESH:D001172), joint deformities (MESH:D016916), autoimmune disease (MESH:D001327), obesity (MESH:D009765), insomnia (MESH:D007319), swelling (MESH:D004487), inflammation (MESH:D007249), sleep disorders (MESH:D012893), ASPR (MESH:C563626)
- **Chemicals:** vitamin B6 (MESH:D025101), alcohol (MESH:D000438), magnesium (MESH:D008274), folate (MESH:D005492), vitamin B12 (MESH:D014805), selenium (MESH:D012643), PIR (-), potassium (MESH:D011188), vitamin C (MESH:D001205), copper (MESH:D003300), prostaglandin E2 (MESH:D015232), vitamin A (MESH:D014801), vitamin E (MESH:D014810), vitamin D (MESH:D014807), salt (MESH:D012492), zinc (MESH:D015032)
- **Species:** Limosilactobacillus reuteri (species) [taxon 1598], Homo sapiens (human, species) [taxon 9606], Nicotiana tabacum (American tobacco, species) [taxon 4097], Lacticaseibacillus rhamnosus (species) [taxon 47715]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12927935/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12927935/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/PMC12927935/full.md

---
Source: https://tomesphere.com/paper/PMC12927935