Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning
Mimansa Jaiswal, Zakaria Aldeneh, Emily Mower Provost

TL;DR
This paper investigates how stress affects multimodal emotion recognition and demonstrates that controlling for stress via adversarial learning improves model generalizability across different domains.
Contribution
It introduces an adversarial learning approach to decorrelate stress from emotion representations, enhancing the robustness of emotion recognition models.
Findings
Stress influences acoustic and lexical emotion predictions.
Models controlling for stress generalize better across domains.
Stress encoding varies across emotion levels and modalities.
Abstract
Various psychological factors affect how individuals express emotions. Yet, when we collect data intended for use in building emotion recognition systems, we often try to do so by creating paradigms that are designed just with a focus on eliciting emotional behavior. Algorithms trained with these types of data are unlikely to function outside of controlled environments because our emotions naturally change as a function of these other factors. In this work, we study how the multimodal expressions of emotion change when an individual is under varying levels of stress. We hypothesize that stress produces modulations that can hide the true underlying emotions of individuals and that we can make emotion recognition algorithms more generalizable by controlling for variations in stress. To this end, we use adversarial networks to decorrelate stress modulations from emotion representations. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
