MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
Weikang Qiu, Zheng Huang, Haoyu Hu, Aosong Feng, Yujun Yan, Rex Ying

TL;DR
MindLLM is a versatile, subject-agnostic model that decodes fMRI signals into text using neuroscience-informed attention and Brain Instruction Tuning, outperforming baselines and providing interpretability.
Contribution
We introduce MindLLM, a novel fMRI-to-text decoding model with neuroscience-informed attention and Brain Instruction Tuning for improved versatility and generalization.
Findings
Outperforms baselines with 12% improvement in downstream tasks
Achieves 24.5% better generalization to unseen subjects
Enables 25% better adaptation to new tasks
Abstract
Decoding functional magnetic resonance imaging (fMRI) signals into text has been a key challenge in the neuroscience community, with the potential to advance brain-computer interfaces and uncover deeper insights into brain mechanisms. However, existing approaches often struggle with suboptimal predictive performance, limited task variety, and poor generalization across subjects. In response to this, we propose MindLLM, a model designed for subject-agnostic and versatile fMRI-to-text decoding. MindLLM consists of an fMRI encoder and an off-the-shelf LLM. The fMRI encoder employs a neuroscience-informed attention mechanism, which is capable of accommodating subjects with varying input shapes and thus achieves high-performance subject-agnostic decoding. Moreover, we introduce Brain Instruction Tuning (BIT), a novel approach that enhances the model's ability to capture diverse semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces · Neurobiology of Language and Bilingualism
MethodsSoftmax · Attention Is All You Need
