Multimodal Emotion-Cause Pair Extraction in Conversations
Fanfan Wang, Zixiang Ding, Rui Xia, Zhaoyu Li, Jianfei Yu

TL;DR
This paper introduces a new task for extracting emotion-cause pairs from multimodal conversations, presents a dataset, and establishes baseline results demonstrating the effectiveness of multimodal information fusion.
Contribution
It proposes the first multimodal emotion-cause pair extraction task, creates a new dataset, and benchmarks baseline models for this novel problem.
Findings
Multimodal features improve emotion-cause pair extraction accuracy.
The Emotion-Cause-in-Friends dataset contains 9,272 annotated pairs.
Baseline models show promising results with multimodal information fusion.
Abstract
Emotion cause analysis has received considerable attention in recent years. Previous studies primarily focused on emotion cause extraction from texts in news articles or microblogs. It is also interesting to discover emotions and their causes in conversations. As conversation in its natural form is multimodal, a large number of studies have been carried out on multimodal emotion recognition in conversations, but there is still a lack of work on multimodal emotion cause analysis. In this work, we introduce a new task named Multimodal Emotion-Cause Pair Extraction in Conversations, aiming to jointly extract emotions and their associated causes from conversations reflected in multiple modalities (text, audio and video). We accordingly construct a multimodal conversational emotion cause dataset, Emotion-Cause-in-Friends, which contains 9,272 multimodal emotion-cause pairs annotated on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Topic Modeling
