Unsupervised Multi-Modal Representation Learning for Affective Computing with Multi-Corpus Wearable Data
Kyle Ross, Paul Hungler, Ali Etemad

TL;DR
This paper introduces an unsupervised deep learning framework using stacked autoencoders for emotion recognition from wearable biosignals, achieving higher accuracy and generalizability across multiple datasets without manual feature extraction.
Contribution
The work presents a novel unsupervised approach with stacked autoencoders for multi-modal biosignal representation, outperforming existing supervised and handcrafted feature methods.
Findings
Outperforms state-of-the-art arousal detection methods.
Effectively generalizes across multiple datasets.
Reduces reliance on manual feature extraction.
Abstract
With recent developments in smart technologies, there has been a growing focus on the use of artificial intelligence and machine learning for affective computing to further enhance the user experience through emotion recognition. Typically, machine learning models used for affective computing are trained using manually extracted features from biological signals. Such features may not generalize well for large datasets and may be sub-optimal in capturing the information from the raw input data. One approach to address this issue is to use fully supervised deep learning methods to learn latent representations of the biosignals. However, this method requires human supervision to label the data, which may be unavailable or difficult to obtain. In this work we propose an unsupervised framework reduce the reliance on human supervision. The proposed framework utilizes two stacked convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
