Emotion Recognition with Pre-Trained Transformers Using Multimodal Signals
Juan Vazquez-Rodriguez (M-PSI), Gr\'egoire Lefebvre, Julien Cumin,, James L Crowley (M-PSI)

TL;DR
This paper explores using Transformer models pretrained on multimodal physiological signals to improve emotion recognition accuracy, demonstrating the effectiveness of pretraining and multimodal inputs on a state-of-the-art dataset.
Contribution
It introduces a Transformer-based multimodal emotion recognition framework with pretraining, advancing the state-of-the-art in physiological signal analysis.
Findings
Pretraining enhances emotion recognition performance.
Multimodal inputs outperform unimodal signals.
Transformer models are effective for physiological data analysis.
Abstract
In this paper, we address the problem of multimodal emotion recognition from multiple physiological signals. We demonstrate that a Transformer-based approach is suitable for this task. In addition, we present how such models may be pretrained in a multimodal scenario to improve emotion recognition performances. We evaluate the benefits of using multimodal inputs and pre-training with our approach on a state-ofthe-art dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · EEG and Brain-Computer Interfaces · Context-Aware Activity Recognition Systems
