# Addressing AI Bias in MRI‐Based Dementia Classification Among Hispanic Individuals

**Authors:** Ngoc‐Huynh Ho, Sokratis Charisis, Sachintha Ransara Brandigampala, Di Wang, Susan R. Heckbert, David M Martinez, Timothy M. Hughes, Nicolas Honnorat, Derek B. Archer, Timothy J. Hohman, Sudha Seshadri, Christos Davatzikos, Mohamad Habes

PMC · DOI: 10.1002/alz70856_105856 · Alzheimer's & Dementia · 2026-01-08

## TL;DR

This study shows that AI tools for dementia diagnosis using MRI scans work less well for Hispanic individuals compared to non-Hispanic Whites and suggests ways to reduce this bias.

## Contribution

The study evaluates bias mitigation techniques like Kernel Mean Matching and correlation removal to improve fairness in dementia classification for different ethnic groups.

## Key findings

- Kernel Mean Matching reduced accuracy disparities between non-Hispanic White and Hispanic populations by up to 6%.
- Correlation removal slightly improved model accuracy and reduced disparities to 4%.
- SHAP maps revealed population-specific brain regions important for dementia prediction, such as the hippocampus in non-Hispanic Whites and the occipital lobe in Hispanic participants.

## Abstract

MRI based dementia classification using diagnostic tools trained predominantly on non‐Hispanic White (NHW) participants often fail to generalize to other groups such as Hispanic, leading to biased outcomes. This study investigates disparities in dementia classification between NHW and Hispanic populations and evaluates bias mitigation techniques to improve diagnostic fairness.

MRI‐based features without harmonization were extracted using MUSE for 2,541 NHW and 249 Hispanic participants from the National Alzheimer's Coordinating Center for classifying normally cognitive and demented patients, with dementia prevalences of 35.97% and 45.78%, respectively. XGBoost classifiers were trained and assessed using balanced accuracy (BA) and fairness metrics, including false positive rate (FPR) and false negative rate (FNR) disparities. Bias mitigation techniques such as correlation removal (CR) and Kernel Mean Matching (KMM) were implemented. SHAP maps illustrated population‐specific brain regions contributing to dementia predictions.

KMM demonstrated the lowest disparity in BA between NHW and Hispanic populations (Figure 1), with differences of 1.5% and 6% when trained on Hispanic and NHW datasets, respectively. CR slightly enhanced overall model accuracy when trained on the combined dataset, achieving a 0.8% improvement over the baseline, and reduced disparities to 4%. KMM minimized FPR differences, achieving 0.8% when trained on NHW and 8.3% when trained on Hispanic data, and recorded the lowest FNR disparity of 11.2% for NHW training (Figure 2). CR achieved the smallest FPR differences when trained on all groups and the lowest FNR disparity when trained on Hispanic participants. SHAP visualizations (Figure 3) revealed that the model identified population‐specific structural brain changes related to dementia, such as hippocampal and temporal lobe importance in NHW and occipital lobe contributions in Hispanic participants. These findings emphasize the risk of biased predictions when models are trained on a single population and tested on another.

This study highlights the need for inclusive training datasets and robust bias mitigation strategies to address disparities in MRI based dementia classification across NHW and Hispanic populations. Techniques such as KMM and CR effectively reduced performance gaps, though balancing fairness with accuracy remains a challenge. The findings encourage robust strategies to enhance fairness in AI‐based healthcare diagnostics.

## Linked entities

- **Diseases:** dementia (MONDO:0001627)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12782832/full.md

---
Source: https://tomesphere.com/paper/PMC12782832