TL;DR
FEEL is a comprehensive benchmarking study evaluating emotion recognition models across diverse physiological datasets, highlighting the importance of domain knowledge and heterogeneity for model generalization.
Contribution
This study introduces FEEL, the first large-scale benchmark for emotion recognition using physiological signals across multiple datasets and modeling approaches.
Findings
Contrastive signal-language pretraining models achieve highest F1 scores.
Handcrafted features outperform raw signal models in low-resource settings.
Models trained on real-life data generalize well to lab and wearable device datasets.
Abstract
Emotion recognition from physiological signals has substantial potential for applications in mental health and emotion-aware systems. However, the lack of standardized, large-scale evaluations across heterogeneous datasets limits progress and model generalization. We introduce FEEL, the first large-scale benchmarking study of emotion recognition using electrodermal activity (EDA) and photoplethysmography (PPG) signals across 19 publicly available datasets. We evaluate 16 architectures spanning traditional machine learning, deep learning, and self-supervised pretraining approaches, structured into four representative modeling paradigms. Our study includes both within-dataset and cross-dataset evaluations, analyzing generalization across variations in experimental settings, device types, and labeling strategies. Our results showed that fine-tuned contrastive signal-language pretraining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
