Privacy Enhanced Multimodal Neural Representations for Emotion Recognition
Mimansa Jaiswal, Emily Mower Provost

TL;DR
This paper investigates how multimodal emotion recognition models can unintentionally leak demographic information and proposes an adversarial learning approach to enhance privacy without significantly reducing emotion recognition accuracy.
Contribution
It introduces a novel adversarial training method to unlearn demographic information in multimodal representations for emotion recognition, addressing privacy concerns across different data modalities.
Findings
Adversarial training improves privacy metrics across modalities.
The method maintains primary task performance with minimal accuracy loss.
Different modalities exhibit varying levels of demographic information leakage.
Abstract
Many mobile applications and virtual conversational agents now aim to recognize and adapt to emotions. To enable this, data are transmitted from users' devices and stored on central servers. Yet, these data contain sensitive information that could be used by mobile applications without user's consent or, maliciously, by an eavesdropping adversary. In this work, we show how multimodal representations trained for a primary task, here emotion recognition, can unintentionally leak demographic information, which could override a selected opt-out option by the user. We analyze how this leakage differs in representations obtained from textual, acoustic, and multimodal data. We use an adversarial learning paradigm to unlearn the private information present in a representation and investigate the effect of varying the strength of the adversarial component on the primary task and on the privacy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
