# A predictive model for cognitive decline using social determinants of health

**Authors:** Yingnan He, Yu Leng, Ana-Maria Vranceanu, Christine S. Ritchie, Deborah Blacker, Sudeshna Das

PMC · DOI: 10.1016/j.jarlif.2025.100056 · JAR Life · 2026-01-06

## TL;DR

This paper develops a machine learning model to predict cognitive decline using social and lifestyle factors, showing promise for low-resource settings.

## Contribution

The novel contribution is a predictive model for cognitive decline using social determinants of health data, with a focus on fairness and interpretability.

## Key findings

- The model achieved an RMSE of 39.25, representing 10.2% of the cognitive score range.
- Key predictors included education level, age, and social activity frequency.
- Fairness analyses revealed biases in underrepresented subgroups, such as those with 7–9 years of education.

## Abstract

Early diagnosis of Alzheimer’s disease and related dementias (AD/ADRD) is critical but often constrained by limited access to fluid and imaging biomarkers, particularly in low-resource settings.

To develop and evaluate a predictive model for cognitive decline using survey-based data, with attention to model interpretability and fairness.

Using data from the Mexican Health and Aging Study (MHAS), a nationally representative longitudinal survey of adults aged 50 and older (N = 4095), we developed a machine learning model to predict future cognitive scores. The model was trained on survey data from 2003 to 2012, encompassing demographic, lifestyle, and social determinants of health (SDoH) variables. A stacked ensemble approach combined five base models—Random Forest, LightGBM, XGBoost, Lasso, and K-Nearest Neighbors—with a Ridge regression meta-model.

The model achieved a root-mean-square error (RMSE) of 39.25 (95 % CI: 38.12–40.52), representing 10.2 % of the cognitive score range, on a 20 % held-out test set. Features influencing predictions, included education level, age, reading behavior, floor material, mother’s education level, social activity frequency, the interaction between the number of living children and age, and overall engagement in activities. Fairness analyses revealed model biases in underrepresented subgroups within the dataset, such as individuals with 7–9 years of education.

These findings highlight the potential of using accessible, low-cost SDoH survey data for predicting risk of cognitive decline in aging populations. They also underscore the importance of incorporating fairness metrics into predictive modeling pipelines to ensure equitable performance across diverse groups.

## Linked entities

- **Diseases:** Alzheimer’s disease (MONDO:0004975)

## Full-text entities

- **Diseases:** dementias (MESH:D003704), cognitive decline (MESH:D003072), AD (MESH:D000544)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12809124/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12809124/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12809124/full.md

---
Source: https://tomesphere.com/paper/PMC12809124