MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding

Inhwa Han; Jaayeon Lee; Jong Chul Ye

arXiv:2405.17720·cs.CV·October 8, 2024·1 cites

MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding

Inhwa Han, Jaayeon Lee, Jong Chul Ye

PDF

Open Access

TL;DR

MindFormer is a novel model that aligns multi-subject fMRI signals to improve brain decoding, enabling more accurate fMRI-to-image and fMRI-to-text generation across individuals.

Contribution

The paper introduces MindFormer, a semantic alignment method with subject-specific tokens and a new embedding scheme, advancing multi-subject fMRI decoding capabilities.

Findings

01

Generates semantically consistent images and texts across subjects

02

Surpasses existing models in multi-subject brain decoding accuracy

03

Maintains high semantic fidelity across diverse subjects

Abstract

Research efforts for visual decoding from fMRI signals have attracted considerable attention in research community. Still multi-subject fMRI decoding with one model has been considered intractable due to the drastic variations in fMRI signals between subjects and even within the same subject across different trials. To address current limitations in multi-subject brain decoding, here we introduce a novel semantic alignment method of multi-subject fMRI signals using so-called MindFormer. This model is specifically designed to generate fMRI-conditioned feature vectors that can be used for conditioning Stable Diffusion model for fMRI- to-image generation or large language model (LLM) for fMRI-to-text generation. More specifically, MindFormer incorporates two key innovations: 1) a subject specific token that effectively capture individual differences in fMRI signals while synergistically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Neural Networks and Applications

MethodsAttention Is All You Need · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections · Absolute Position Encodings