Multi-Modal Sarcasm Detection Based on Contrastive Attention Mechanism
Xiaoqiang Zhang, Ying Chen, Guangyuan Li

TL;DR
This paper introduces ConAttSD, a multi-modal sarcasm detection model that leverages contrastive attention to identify incongruity between modalities, improving detection accuracy in video conversations.
Contribution
It proposes a novel contrastive attention mechanism for multi-modal sarcasm detection, addressing modality incongruity more effectively than previous methods.
Findings
Outperforms existing models on MUStARD dataset
Effectively captures modality incongruity for sarcasm detection
Demonstrates significant improvement in detection accuracy
Abstract
In the past decade, sarcasm detection has been intensively conducted in a textual scenario. With the popularization of video communication, the analysis in multi-modal scenarios has received much attention in recent years. Therefore, multi-modal sarcasm detection, which aims at detecting sarcasm in video conversations, becomes increasingly hot in both the natural language processing community and the multi-modal analysis community. In this paper, considering that sarcasm is often conveyed through incongruity between modalities (e.g., text expressing a compliment while acoustic tone indicating a grumble), we construct a Contras-tive-Attention-based Sarcasm Detection (ConAttSD) model, which uses an inter-modality contrastive attention mechanism to extract several contrastive features for an utterance. A contrastive feature represents the incongruity of information between two modalities.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Language, Metaphor, and Cognition · Topic Modeling
