MindSemantix: Deciphering Brain Visual Experiences with a Brain-Language   Model

Ziqi Ren; Jie Li; Xuetong Xue; Xin Li; Fan Yang; Zhicheng Jiao; Xinbo; Gao

arXiv:2405.18812·cs.CV·May 30, 2024

MindSemantix: Deciphering Brain Visual Experiences with a Brain-Language Model

Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo, Gao

PDF

Open Access

TL;DR

MindSemantix is a novel brain-language model that decodes brain activity into meaningful captions of visual experiences using a multi-modal framework, enhancing interpretability and downstream applications in neuroscience.

Contribution

The paper introduces MindSemantix, integrating LLMs with brain activity analysis through a Brain-Text Transformer and pre-trained brain encoder, advancing brain decoding and captioning capabilities.

Findings

01

Substantial quantitative improvements over prior methods

02

Effective multi-modal alignment of brain, vision, and language

03

Enhanced generalizability of neural representations

Abstract

Deciphering the human visual experience through brain activities captured by fMRI represents a compelling and cutting-edge challenge in the field of neuroscience research. Compared to merely predicting the viewed image itself, decoding brain activity into meaningful captions provides a higher-level interpretation and summarization of visual information, which naturally enhances the application flexibility in real-world situations. In this work, we introduce MindSemantix, a novel multi-modal framework that enables LLMs to comprehend visually-evoked semantic content in brain activity. Our MindSemantix explores a more ideal brain captioning paradigm by weaving LLMs into brain activity analysis, crafting a seamless, end-to-end Brain-Language Model. To effectively capture semantic information from brain responses, we propose Brain-Text Transformer, utilizing a Brain Q-Former as its core…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces

MethodsLinear Layer · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections