Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation   Generation

Palaash Goel; Dushyant Singh Chauhan; Md Shad Akhtar

arXiv:2502.07391·cs.CL·February 12, 2025

Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation

Palaash Goel, Dushyant Singh Chauhan, Md Shad Akhtar

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces TURBO, a novel multimodal sarcasm explanation model that effectively leverages target information and inter-modality relationships to generate more accurate and nuanced explanations, outperforming existing methods.

Contribution

The paper proposes TURBO, a shared fusion-based model that incorporates sarcasm targets into multimodal explanation generation, advancing the state-of-the-art in sarcasm understanding.

Findings

01

TURBO outperforms baselines with a +3.3% accuracy margin.

02

LLMs generate explanations but often miss sarcasm nuances.

03

Human evaluation favors TURBO's explanations over others.

Abstract

Sarcasm is a linguistic phenomenon that intends to ridicule a target (e.g., entity, event, or person) in an inherent way. Multimodal Sarcasm Explanation (MuSE) aims at revealing the intended irony in a sarcastic post using a natural language explanation. Though important, existing systems overlooked the significance of the target of sarcasm in generating explanations. In this paper, we propose a Target-aUgmented shaRed fusion-Based sarcasm explanatiOn model, aka. TURBO. We design a novel shared-fusion mechanism to leverage the inter-modality relationships between an image and its caption. TURBO assumes the target of the sarcasm and guides the multimodal shared fusion mechanism in learning intricacies of the intended irony for explanations. We evaluate our proposed TURBO model on the MORE+ dataset. Comparison against multiple baselines and state-of-the-art models signifies the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

flamenlp/TURBO
pytorchOfficial

Videos

Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting