Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning   and Diffusion Priors

Paul S. Scotti; Atmadeep Banerjee; Jimmie Goode; Stepan Shabalin; Alex; Nguyen; Ethan Cohen; Aidan J. Dempster; Nathalie Verlinde; Elad Yundler,; David Weisberg; Kenneth A. Norman; Tanishq Mathew Abraham

arXiv:2305.18274·cs.CV·October 10, 2023·40 cites

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors

Paul S. Scotti, Atmadeep Banerjee, Jimmie Goode, Stepan Shabalin, Alex, Nguyen, Ethan Cohen, Aidan J. Dempster, Nathalie Verlinde, Elad Yundler,, David Weisberg, Kenneth A. Norman, Tanishq Mathew Abraham

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

MindEye is a novel approach that uses contrastive learning and diffusion priors to accurately reconstruct and retrieve images from fMRI brain activity, achieving state-of-the-art results.

Contribution

It introduces a dual-module system for fMRI-to-image mapping, combining retrieval and reconstruction, with improved training techniques and larger models for enhanced performance.

Findings

01

Achieves state-of-the-art in image reconstruction and retrieval from fMRI data.

02

Can accurately retrieve images from large-scale databases like LAION-5B.

03

Better preserves low-level image features using img2img with autoencoders.

Abstract

We present MindEye, a novel fMRI-to-image approach to retrieve and reconstruct viewed images from brain activity. Our model comprises two parallel submodules that are specialized for retrieval (using contrastive learning) and reconstruction (using a diffusion prior). MindEye can map fMRI brain activity to any high dimensional multimodal latent space, like CLIP image space, enabling image reconstruction using generative models that accept embeddings from this latent space. We comprehensively compare our approach with other existing methods, using both qualitative side-by-side comparisons and quantitative evaluations, and show that MindEye achieves state-of-the-art performance in both reconstruction and retrieval tasks. In particular, MindEye can retrieve the exact original image even among highly similar candidates indicating that its brain embeddings retain fine-grained image-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

medarc-ai/fmri-reconstruction-nsd
pytorchOfficial

Datasets

AshwinKM2005/fmri-fm
dataset· 166 dl
166 dl

Videos

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Medical Image Segmentation Techniques · Medical Imaging Techniques and Applications

MethodsDiffusion · Contrastive Language-Image Pre-training