MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding
Inhwa Han, Jaayeon Lee, Jong Chul Ye

TL;DR
MindFormer is a novel model that aligns multi-subject fMRI signals to improve brain decoding, enabling more accurate fMRI-to-image and fMRI-to-text generation across individuals.
Contribution
The paper introduces MindFormer, a semantic alignment method with subject-specific tokens and a new embedding scheme, advancing multi-subject fMRI decoding capabilities.
Findings
Generates semantically consistent images and texts across subjects
Surpasses existing models in multi-subject brain decoding accuracy
Maintains high semantic fidelity across diverse subjects
Abstract
Research efforts for visual decoding from fMRI signals have attracted considerable attention in research community. Still multi-subject fMRI decoding with one model has been considered intractable due to the drastic variations in fMRI signals between subjects and even within the same subject across different trials. To address current limitations in multi-subject brain decoding, here we introduce a novel semantic alignment method of multi-subject fMRI signals using so-called MindFormer. This model is specifically designed to generate fMRI-conditioned feature vectors that can be used for conditioning Stable Diffusion model for fMRI- to-image generation or large language model (LLM) for fMRI-to-text generation. More specifically, MindFormer incorporates two key innovations: 1) a subject specific token that effectively capture individual differences in fMRI signals while synergistically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Neural Networks and Applications
MethodsAttention Is All You Need · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections · Absolute Position Encodings
