# AD diagnosis model based on fusion of heterogeneous brain imaging and genomic data

**Authors:** Zhihao Zhang, Ruixia Zhang, Wenzhong Yang, Ke lv, Miao Wu, Lianghui Xu

PMC · DOI: 10.3389/fnins.2026.1719390 · 2026-03-06

## TL;DR

This study improves early Alzheimer's detection by combining brain imaging and genetic data with a new matching strategy.

## Contribution

A gender-corrected random matching strategy for non-paired multi-modal data fusion in early AD screening.

## Key findings

- Multi-modal data outperformed single-modal data in predictive performance.
- Ensemble learning models showed stronger fitting capabilities on paired datasets.
- 16 genetic and 6 brain region volume features were identified as highly important.

## Abstract

Alzheimer's disease (AD) is a common neurodegenerative disorder in the elderly population, and early screening can effectively delay the progression of the disease. Mild cognitive impairment (MCI) occurs prior to the onset of AD; however, the accuracy of existing MCI-to-AD prediction methods remains relatively low. Additionally, small sample sizes and high feature dimensions often lead to model overfitting, highlighting the need for effective early screening approaches. To address the aforementioned issues, this study integrated non-paired multi-modal features—including clinical indicators from the ADNI database, blood biomarkers, brain region volume features extracted from MRI, and genetic biomarkers from the GEO database—and proposed a gender-corrected random matching strategy. The Random Forest algorithm was adopted to evaluate this strategy, analyze feature importance, and compare the performance of 9 machine learning algorithms based on the top 40 ranked features. The predictive performance of multi-modal data was superior to that of single-modal data, and the proposed strategy achieved favorable results in early AD screening. 16 specific genetic features (e.g., IFI27, EDF1, RAP2A, KIF5C, SERPINA3, FBXW7, IFITM1, ISG15, PSMB3, APOE4, KCNB1, PSPH, HMGN2, S100A13, IFIT3, and CALM1) and 6 brain region volume features ranked high in terms of importance. When validated using paired datasets from ADNI across the 9 algorithms, ensemble learning models demonstrated significantly stronger fitting capabilities. The non-paired multi-modal fusion approach not only expands the sample size but also enhances the generalization ability and robustness of the model. This provides a theoretical basis for the application of this strategy in the field of small-sample medical research.

## Linked entities

- **Genes:** IFI27 (interferon alpha inducible protein 27) [NCBI Gene 3429], EDF1 (endothelial differentiation related factor 1) [NCBI Gene 8721], RAP2A (RAP2A, member of RAS oncogene family) [NCBI Gene 5911], KIF5C (kinesin family member 5C) [NCBI Gene 3800], SERPINA3 (serpin family A member 3) [NCBI Gene 12], FBXW7 (F-box and WD repeat domain containing 7) [NCBI Gene 55294], IFITM1 (interferon induced transmembrane protein 1) [NCBI Gene 8519], ISG15 (ISG15 ubiquitin like modifier) [NCBI Gene 9636], PSMB3 (proteasome 20S subunit beta 3) [NCBI Gene 5691], APOE (apolipoprotein E) [NCBI Gene 348], KCNB1 (potassium voltage-gated channel subfamily B member 1) [NCBI Gene 3745], PSPH (phosphoserine phosphatase) [NCBI Gene 5723], HMGN2 (high mobility group nucleosomal binding domain 2) [NCBI Gene 3151], S100A13 (S100 calcium binding protein A13) [NCBI Gene 6284], IFIT3 (interferon induced protein with tetratricopeptide repeats 3) [NCBI Gene 3437], CALM1 (calmodulin 1) [NCBI Gene 801]
- **Diseases:** Alzheimer's disease (MONDO:0004975)

## Full-text entities

- **Genes:** FBXW7 (F-box and WD repeat domain containing 7) [NCBI Gene 55294] {aka AGO, CDC4, DEDHIL, FBW6, FBW7, FBX30}, RAP2A (RAP2A, member of RAS oncogene family) [NCBI Gene 5911] {aka K-REV, KREV, RAP2, RbBP-30}, HMGN2 (high mobility group nucleosomal binding domain 2) [NCBI Gene 3151] {aka HMG17}, KCNB1 (potassium voltage-gated channel subfamily B member 1) [NCBI Gene 3745] {aka DEE26, DRK1, Kv2.1}, APOE (apolipoprotein E) [NCBI Gene 348] {aka AD2, APO-E, ApoE4, LDLCQ5, LPG}, KIF5C (kinesin family member 5C) [NCBI Gene 3800] {aka CDCBM2, KINN, NKHC, NKHC-2, NKHC2}, IFIT3 (interferon induced protein with tetratricopeptide repeats 3) [NCBI Gene 3437] {aka CIG-49, GARG-49, IFI60, IFIT4, IRG2, ISG60}, CALM1 (calmodulin 1) [NCBI Gene 801] {aka CALML2, CAM2, CAM3, CAMB, CAMC, CAMI}, IFI27 (interferon alpha inducible protein 27) [NCBI Gene 3429] {aka FAM14D, ISG12, ISG12A, P27}, EDF1 (endothelial differentiation related factor 1) [NCBI Gene 8721] {aka CFAP280, EDF-1, MBF1}, ISG15 (ISG15 ubiquitin like modifier) [NCBI Gene 9636] {aka G1P2, IFI15, IMD38, IP17, UCRP, hUCRP}, IFITM1 (interferon induced transmembrane protein 1) [NCBI Gene 8519] {aka 9-27, CD225, DSPA2a, IFI17, LEU13}, PSMB3 (proteasome 20S subunit beta 3) [NCBI Gene 5691] {aka HC10-II}, S100A13 (S100 calcium binding protein A13) [NCBI Gene 6284], PSPH (phosphoserine phosphatase) [NCBI Gene 5723] {aka PSP, PSPHD}, SERPINA3 (serpin family A member 3) [NCBI Gene 12] {aka AACT, ACT, GIG24, GIG25}
- **Diseases:** AD (MESH:D000544), MCI (MESH:D060825), cognitive impairment (MESH:D003072), neurodegenerative disorder (MESH:D019636)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13002799/full.md

---
Source: https://tomesphere.com/paper/PMC13002799