Cross-domain EEG-based Emotion Recognition with Contrastive Learning
Rui Yan, Yibo Li, Han Ding, Fei Wang

TL;DR
This paper presents EmotionCLIP, a novel EEG-text matching framework using contrastive learning and a specialized backbone, achieving state-of-the-art cross-domain emotion recognition accuracy on benchmark datasets.
Contribution
It introduces a new multimodal contrastive learning approach with a tailored backbone for improved cross-domain EEG emotion recognition.
Findings
Achieved cross-subject accuracy of 88.69% on SEED.
Achieved cross-time accuracy of 88.46% on SEED.
Outperformed existing models in experiments.
Abstract
Electroencephalogram (EEG)-based emotion recognition is vital for affective computing but faces challenges in feature utilization and cross-domain generalization. This work introduces EmotionCLIP, which reformulates recognition as an EEG-text matching task within the CLIP framework. A tailored backbone, SST-LegoViT, captures spatial, spectral, and temporal features using multi-scale convolution and Transformer modules. Experiments on SEED and SEED-IV datasets show superior cross-subject accuracies of 88.69\% and 73.50\%, and cross-time accuracies of 88.46\% and 77.54\%, outperforming existing models. Results demonstrate the effectiveness of multimodal contrastive learning for robust EEG emotion recognition. The code is available at https://github.com/Departure2021/EmotionCLIP.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · EEG and Brain-Computer Interfaces
