# Harmonizing neuropsychological test data across prospective studies

**Authors:** Rosita Shishegar, James D. Doecke, Yen Ying Lim, Pierrick Bourgeat, Vincent Dore, Bhargav Tallapragada, Simon M. Laws, Tenielle Porter, Samantha Burnham, Azadeh Feizpour, Ashley Gillman, Michael Weiner, Jason Hassenstab, Christopher C. Rowe, Victor L. Villemagne, Colin L. Masters, Jurgen Fripp, Hamid Sohrabi, Paul Maruff

PMC · DOI: 10.1002/alz.71186 · 2026-02-14

## TL;DR

This study combines data from three Alzheimer's research studies to improve understanding of the disease by harmonizing cognitive test scores.

## Contribution

A validated method using MissForest imputation to harmonize cognitive test data across Alzheimer's disease studies.

## Key findings

- MissForest imputation showed high accuracy with mean absolute errors ≤ test–retest variability in most cases.
- Composite cognitive scores reflected known disease patterns and showed significant stratification across clinical–pathological groups.
- Digit Symbol Coding and Trail-Making Test Part B had slightly higher MAEs but still within individual variation.

## Abstract

Alzheimer's disease (AD) research relies on large datasets and advanced statistical models. However, individual population studies often lack sufficient sample size for conclusive results. Harmonizing cognitive test data across studies can address this gap, despite differences in testing protocols. This study harmonizes cognitive data from three major AD cohorts to support robust clinical–pathological modelling.

Information from the Alzheimer's Disease Neuroimaging Initiative (N = 1446); Australian Imaging, Biomarkers and Lifestyle (N = 1764); and Open Access Series of Imaging Studies‐3 (N = 440) were integrated, including cognitive scores, demographics, genetics, and clinical and neuroimaging data. Neuropsychological tests relevant to AD were harmonized using MissForest, a machine learning–based imputation method. Validation involved assessing imputation accuracy and analyzing composite cognitive scores across clinical–pathological groups.

Imputation showed high accuracy (mean absolute error ≤ test–retest variability in cognitively unimpaired participants). Composite scores reflected known disease patterns with significant stratification across clinical–pathological groups.

The validated harmonization approach demonstrated reliable imputation, enabling more powerful AD models and supporting future diagnostic and therapeutic advances.

MissForest imputation reliably harmonized cognitive test scores across Alzheimer's disease (AD) studies.Harmonized dataset included 1446 Alzheimer's Disease Neuroimaging Initiative; 1764 Australian Imaging, Biomarkers and Lifestyle; and 440 Open Access Series of Imaging Studies‐3 subjects.Mean absolute errors (MAEs) ≤ test–retest variability in cognitively unimpaired participants for most tests.Digit Symbol Coding and Trail‐Making Test Part B MAEs slightly > test—retest, < individual variation.Composite scores had lower baseline and faster decline with increasing AD severity.

MissForest imputation reliably harmonized cognitive test scores across Alzheimer's disease (AD) studies.

Harmonized dataset included 1446 Alzheimer's Disease Neuroimaging Initiative; 1764 Australian Imaging, Biomarkers and Lifestyle; and 440 Open Access Series of Imaging Studies‐3 subjects.

Mean absolute errors (MAEs) ≤ test–retest variability in cognitively unimpaired participants for most tests.

Digit Symbol Coding and Trail‐Making Test Part B MAEs slightly > test—retest, < individual variation.

Composite scores had lower baseline and faster decline with increasing AD severity.

## Linked entities

- **Diseases:** Alzheimer's disease (MONDO:0004975)

## Full-text entities

- **Genes:** FAS (Fas cell surface death receptor) [NCBI Gene 355] {aka ALPS1A, APO-1, APT1, CD95, FAS1, FASTM}, APP (amyloid beta precursor protein) [NCBI Gene 351] {aka AAA, ABETA, ABPP, AD1, APPI, CTFgamma}, APOE (apolipoprotein E) [NCBI Gene 348] {aka AD2, APO-E, ApoE4, LDLCQ5, LPG}, MAPT (microtubule associated protein tau) [NCBI Gene 4137] {aka DDPAC, FTD1, FTDP-17, MAPTL, MSTD, MTBT1}
- **Diseases:** amyloid (MESH:C000718787), Dementia (MESH:D003704), frontotemporal dementia (MESH:D057180), AIBL (MESH:C564543), CU (MESH:D003072), disease (MESH:D004194), RESEARCH (MESH:D014947), Parkinson disease (MESH:D010300), MCI (MESH:D060825), ADOPIC (MESH:D000544), vascular disease (MESH:D014652), cognitive symptoms (MESH:D019954)
- **Chemicals:** TMT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12906370/full.md

---
Source: https://tomesphere.com/paper/PMC12906370