Pretraining Transformer-Based Models on Diffusion-Generated Synthetic Graphs for Alzheimer's Disease Prediction
Abolfazl Moslemi, Hossein Peyvandi

TL;DR
This paper introduces a Transformer-based framework that leverages diffusion-generated synthetic graphs for Alzheimer's disease prediction, improving model performance in low-data, imbalanced clinical scenarios.
Contribution
It combines diffusion-based synthetic data generation with graph transformer encoders and transfer learning to enhance Alzheimer's diagnosis accuracy.
Findings
Outperforms baseline models in AUC, accuracy, sensitivity, and specificity.
Synthetic data improves model generalization in low-sample settings.
Distributional alignment metrics confirm synthetic data quality.
Abstract
Early and accurate detection of Alzheimer's disease (AD) is crucial for enabling timely intervention and improving outcomes. However, developing reliable machine learning (ML) models for AD diagnosis is challenging due to limited labeled data, multi-site heterogeneity, and class imbalance. We propose a Transformer-based diagnostic framework that combines diffusion-based synthetic data generation with graph representation learning and transfer learning. A class-conditional denoising diffusion probabilistic model (DDPM) is trained on the real-world NACC dataset to generate a large synthetic cohort that mirrors multimodal clinical and neuroimaging feature distributions while balancing diagnostic classes. Modality-specific Graph Transformer encoders are first pretrained on this synthetic data to learn robust, class-discriminative representations and are then frozen while a neural classifier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Dementia and Cognitive Impairment Research · Bayesian Methods and Mixture Models
