Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank, Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

TL;DR
This paper benchmarks and extends the MUStARD++ multimodal sarcasm detection dataset, improving performance through advanced encoders and addressing class imbalance with a new dataset extension from House MD.
Contribution
It provides a comprehensive benchmark of state-of-the-art multimodal encoders on MUStARD++, introduces MUStARD++ Balanced to address sarcasm type imbalance, and adds diverse new data from House MD.
Findings
Achieved 2% macro-F1 improvement over existing benchmarks.
Further 2.4% macro-F1 boost with MUStARD++ Balanced.
Added diverse clips from House MD to enhance dataset diversity.
Abstract
The introduction of the MUStARD dataset, and its emotion recognition extension MUStARD++, have identified sarcasm to be a multi-modal phenomenon -- expressed not only in natural language text, but also through manners of speech (like tonality and intonation) and visual cues (facial expression). With this work, we aim to perform a rigorous benchmarking of the MUStARD++ dataset by considering state-of-the-art language, speech, and visual encoders, for fully utilizing the totality of the multi-modal richness that it has to offer, achieving a 2\% improvement in macro-F1 over the existing benchmark. Additionally, to cure the imbalance in the `sarcasm type' category in MUStARD++, we propose an extension, which we call \emph{MUStARD++ Balanced}, benchmarking the same with instances from the extension split across both train and test sets, achieving a further 2.4\% macro-F1 boost. The new clips…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Emotion and Mood Recognition · Humor Studies and Applications
