CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding
Qiongyi Zhou, Changde Du, Shengpei Wang, Huiguang He

TL;DR
This paper introduces CLIP-MUSED, a novel multi-subject neural decoding approach that leverages CLIP-guided features, Transformer-based modeling, and subject-specific tokens to improve generalization and performance in decoding visual stimuli from neural responses.
Contribution
The paper proposes a Transformer-based multi-subject neural decoding method guided by CLIP, with learnable subject tokens and RSA, achieving state-of-the-art results and better modeling of inter-subject neural relationships.
Findings
Outperforms existing multi-subject decoding methods.
Achieves state-of-the-art performance on two fMRI datasets.
Provides visualizations that validate the effectiveness of the approach.
Abstract
The study of decoding visual neural information faces challenges in generalizing single-subject decoding models to multiple subjects, due to individual differences. Moreover, the limited availability of data from a single subject has a constraining impact on model performance. Although prior multi-subject decoding methods have made significant progress, they still suffer from several limitations, including difficulty in extracting global neural response features, linear scaling of model parameters with the number of subjects, and inadequate characterization of the relationship between neural responses of different subjects to various stimuli. To overcome these limitations, we propose a CLIP-guided Multi-sUbject visual neural information SEmantic Decoding (CLIP-MUSED) method. Our method consists of a Transformer-based feature extractor to effectively model global neural representations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Cell Image Analysis Techniques
MethodsContrastive Language-Image Pre-training
