A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds
Anton Orlichenko, Gang Qu, Ziyu Zhou, Anqi Liu, Hong-Wen Deng,, Zhengming Ding, Julia M. Stephen, Tony W. Wilson, Vince D. Calhoun, and, Yu-Ping Wang

TL;DR
This paper introduces DemoVAE, a variational autoencoder model that removes demographic confounds from fMRI data and generates synthetic data conditioned on demographics, improving data sharing and analysis.
Contribution
DemoVAE is a novel demographic-conditioned VAE that effectively decorrelates fMRI features from demographics and generates high-quality synthetic fMRI data.
Findings
DemoVAE recapitulates group differences in fMRI data.
Most clinical and cognitive fields are not correlated with DemoVAE latents.
The model outperforms traditional VAE and GAN in capturing fMRI distribution.
Abstract
Objective: fMRI and derived measures such as functional connectivity (FC) have been used to predict brain age, general fluid intelligence, psychiatric disease status, and preclinical neurodegenerative disease. However, it is not always clear that all demographic confounds, such as age, sex, and race, have been removed from fMRI data. Additionally, many fMRI datasets are restricted to authorized researchers, making dissemination of these valuable data sources challenging. Methods: We create a variational autoencoder (VAE)-based model, DemoVAE, to decorrelate fMRI features from demographics and generate high-quality synthetic fMRI data based on user-supplied demographics. We train and validate our model using two large, widely used datasets, the Philadelphia Neurodevelopmental Cohort (PNC) and Bipolar and Schizophrenia Network for Intermediate Phenotypes (BSNIP). Results: We find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis
