Emotion Recognition from Multiple Modalities: Fundamentals and   Methodologies

Sicheng Zhao; Guoli Jia; Jufeng Yang; Guiguang Ding; Kurt Keutzer

arXiv:2108.10152·eess.SP·November 10, 2021

Emotion Recognition from Multiple Modalities: Fundamentals and Methodologies

Sicheng Zhao, Guoli Jia, Jufeng Yang, Guiguang Ding, Kurt Keutzer

PDF

TL;DR

This paper provides a comprehensive overview of multi-modal emotion recognition, covering fundamental models, methodologies, challenges, and applications in enabling machines to interpret human emotions across various modalities.

Contribution

It offers a systematic summary of existing approaches, challenges, and future directions in multi-modal emotion recognition, serving as a foundational tutorial in the field.

Findings

01

Summarizes key emotion representation models and affective modalities.

02

Reviews strategies for emotion annotation and computational tasks.

03

Discusses representation learning, feature fusion, and domain adaptation techniques.

Abstract

Humans are emotional creatures. Multiple modalities are often involved when we express emotions, whether we do so explicitly (e.g., facial expression, speech) or implicitly (e.g., text, image). Enabling machines to have emotional intelligence, i.e., recognizing, interpreting, processing, and simulating emotions, is becoming increasingly important. In this tutorial, we discuss several key aspects of multi-modal emotion recognition (MER). We begin with a brief introduction on widely used emotion representation models and affective modalities. We then summarize existing emotion annotation strategies and corresponding computational tasks, followed by the description of main challenges in MER. Furthermore, we present some representative approaches on representation learning of each affective modality, feature fusion of different affective modalities, classifier optimization for MER, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.