AffectGPT: Dataset and Framework for Explainable Multimodal Emotion   Recognition

Zheng Lian; Haiyang Sun; Licai Sun; Jiangyan Yi; Bin Liu; Jianhua Tao

arXiv:2407.07653·cs.HC·July 11, 2024·1 cites

AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition

Zheng Lian, Haiyang Sun, Licai Sun, Jiangyan Yi, Bin Liu, Jianhua Tao

PDF

Open Access 1 Repo

TL;DR

AffectGPT introduces a large-scale, coarsely-labeled multimodal emotion dataset and a two-stage training framework to improve explainable emotion recognition, reducing annotation costs and enhancing model performance.

Contribution

The paper presents EMER-Coarse dataset construction and a novel two-stage AffectGPT training framework for better multimodal emotion recognition.

Findings

01

AffectGPT outperforms baseline models on EMER tasks.

02

The EMER-Coarse dataset significantly expands available data.

03

Two-stage training improves alignment with manual annotations.

Abstract

Explainable Multimodal Emotion Recognition (EMER) is an emerging task that aims to achieve reliable and accurate emotion recognition. However, due to the high annotation cost, the existing dataset (denoted as EMER-Fine) is small, making it difficult to perform supervised training. To reduce the annotation cost and expand the dataset size, this paper reviews the previous dataset construction process. Then, we simplify the annotation pipeline, avoid manual checks, and replace the closed-source models with open-source models. Finally, we build \textbf{EMER-Coarse}, a coarsely-labeled dataset containing large-scale samples. Besides the dataset, we propose a two-stage training framework \textbf{AffectGPT}. The first stage exploits EMER-Coarse to learn a coarse mapping between multimodal inputs and emotion-related descriptions; the second stage uses EMER-Fine to better align with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zeroqiaoba/affectgpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition