In-Context Examples Matter: Improving Emotion Recognition in Conversation with Instruction Tuning
Hui Ma, Bo Zhang, Jinpeng Hu, Zenglin Shi

TL;DR
This paper introduces InitERC, a one-stage in-context instruction tuning framework for emotion recognition in conversation, which effectively aligns speaker, context, and emotion cues, outperforming multi-stage methods.
Contribution
Proposes InitERC, a novel one-stage in-context instruction tuning approach that improves emotion recognition by better aligning speaker, context, and emotion information in conversations.
Findings
InitERC significantly outperforms existing baselines.
In-context example selection impacts model performance.
Number and order of examples influence emotion recognition accuracy.
Abstract
Emotion recognition in conversation (ERC) aims to identify the emotion of each utterance in a conversation, playing a vital role in empathetic artificial intelligence. With the growing of large language models (LLMs), instruction tuning has emerged as a critical paradigm for ERC. Existing studies mainly focus on multi-stage instruction tuning, which first endows LLMs with speaker characteristics, and then conducts context-aware instruction tuning to comprehend emotional states. However, these methods inherently constrains the capacity to jointly capture the dynamic interaction between speaker characteristics and conversational context, resulting in weak alignment among speaker identity, contextual cues, and emotion states within a unified framework. In this paper, we propose InitERC, a simple yet effective one-stage in-context instruction tuning framework for ERC. InitERC adapts LLMs to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Social Robot Interaction and HRI · Emotion and Mood Recognition
