PPG-Based Affect Recognition with Long-Range Deep Models: A Measurement-Driven Comparison of CNN, Transformer, and Mamba Architectures
Karim Alghoul, Hussein Al Osman, Abdulmotaleb El Saddik

TL;DR
This study compares CNN, Transformer, and Mamba deep learning models for PPG-based affect recognition, finding CNNs most effective overall, with Transformers offering a better balance of F1 scores.
Contribution
First comprehensive evaluation of Transformer and Mamba architectures for PPG affect recognition, providing practical insights for wearable affective computing.
Findings
Transformers and Mamba achieve comparable performance to CNNs.
CNNs provide the highest accuracy and smallest size.
Transformers balance F1 scores for Arousal and Relaxation.
Abstract
Photoplethysmography (PPG) is increasingly used in wearable affective computing due to its low cost and ease of integration into consumer devices. Recent advances in deep learning have introduced long-range sequence models, such as Transformers, and state-space models, like Mamba, which have demonstrated strong performance on natural language and general time-series tasks. However, it remains unclear whether these architectures offer tangible benefits over widely used Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTMs) for PPG-based affect recognition, given that datasets are typically small and noisy. This work presents a measurement-driven comparison of four deep learning architectures, CNN, CNN-LSTM hybrid, Transformers, and Mamba, for classifying arousal, valence, and relaxation states from wrist-based PPG signals. All models are evaluated under a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
