AffectGPT-R1: Leveraging Reinforcement Learning for Open-Vocabulary Multimodal Emotion Recognition
Zheng Lian, Fan Zhang, Yazhou Zhang, Jianhua Tao, Rui Liu, Haoyu Chen, Xiaobai Li, Bin He

TL;DR
AffectGPT-R1 introduces a reinforcement learning approach for open-vocabulary multimodal emotion recognition, aligning training objectives with evaluation metrics and improving emotion understanding beyond traditional discriminative methods.
Contribution
This paper presents AffectGPT-R1, a novel reinforcement learning framework that optimizes emotion recognition metrics directly and incorporates explicit reasoning and auxiliary rewards for enhanced performance.
Findings
Achieves state-of-the-art results on MER-UniBench.
Significantly improves open-vocabulary emotion recognition performance.
Demonstrates the effectiveness of reinforcement learning in aligning training with evaluation metrics.
Abstract
Open-Vocabulary Multimodal Emotion Recognition (OV-MER) aims to predict emotions without being constrained by label spaces, enabling fine-grained emotion understanding. Unlike traditional discriminative methods, OV-MER leverages generative models to capture the full spectrum of emotions and employs emotion wheels (EWs) for metric calculation. Previous approaches (e.g., AffectGPT) primarily rely on token-level loss during training. However, this objective is misaligned with the metrics used in OV-MER, while these metrics cannot be optimized via gradient backpropagation. To address this limitation, we propose AffectGPT-R1, a reinforcement learning framework that treats EW-based metrics as a reward function and applies policy optimization to maximize this reward. Additionally, we introduce an explicit reasoning process and examine its necessity in OV-MER. To further guide model behavior,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Mental Health via Writing
