CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic   Decoding

Qiongyi Zhou; Changde Du; Shengpei Wang; Huiguang He

arXiv:2402.08994·cs.CV·February 15, 2024·2 cites

CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding

Qiongyi Zhou, Changde Du, Shengpei Wang, Huiguang He

PDF

Open Access 1 Repo

TL;DR

This paper introduces CLIP-MUSED, a novel multi-subject neural decoding approach that leverages CLIP-guided features, Transformer-based modeling, and subject-specific tokens to improve generalization and performance in decoding visual stimuli from neural responses.

Contribution

The paper proposes a Transformer-based multi-subject neural decoding method guided by CLIP, with learnable subject tokens and RSA, achieving state-of-the-art results and better modeling of inter-subject neural relationships.

Findings

01

Outperforms existing multi-subject decoding methods.

02

Achieves state-of-the-art performance on two fMRI datasets.

03

Provides visualizations that validate the effectiveness of the approach.

Abstract

The study of decoding visual neural information faces challenges in generalizing single-subject decoding models to multiple subjects, due to individual differences. Moreover, the limited availability of data from a single subject has a constraining impact on model performance. Although prior multi-subject decoding methods have made significant progress, they still suffer from several limitations, including difficulty in extracting global neural response features, linear scaling of model parameters with the number of subjects, and inadequate characterization of the relationship between neural responses of different subjects to various stimuli. To overcome these limitations, we propose a CLIP-guided Multi-sUbject visual neural information SEmantic Decoding (CLIP-MUSED) method. Our method consists of a Transformer-based feature extractor to effectively model global neural representations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clip-mused/clip-mused
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Cell Image Analysis Techniques

MethodsContrastive Language-Image Pre-training