VyAnG-Net: A Novel Multi-Modal Sarcasm Recognition Model by Uncovering Visual, Acoustic and Glossary Features
Ananya Pandey, Dinesh Kumar Vishwakarma

TL;DR
VyAnG-Net is a multi-modal sarcasm recognition model that integrates visual, acoustic, and textual features using attention mechanisms, achieving state-of-the-art accuracy on benchmark datasets.
Contribution
It introduces a novel multi-modal architecture with attention-based feature extraction and fusion for sarcasm detection, outperforming existing methods.
Findings
Achieved 79.86% accuracy on MUSTaRD dataset.
Demonstrated superior performance over existing sarcasm recognition models.
Showed good adaptability in cross-dataset testing with MUStARD++.
Abstract
Various linguistic and non-linguistic clues, such as excessive emphasis on a word, a shift in the tone of voice, or an awkward expression, frequently convey sarcasm. The computer vision problem of sarcasm recognition in conversation aims to identify hidden sarcastic, criticizing, and metaphorical information embedded in everyday dialogue. Prior, sarcasm recognition has focused mainly on text. Still, it is critical to consider all textual information, audio stream, facial expression, and body position for reliable sarcasm identification. Hence, we propose a novel approach that combines a lightweight depth attention module with a self-regulated ConvNet to concentrate on the most crucial features of visual data and an attentional tokenizer based strategy to extract the most critical context-specific information from the textual data. The following is a list of the key contributions that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Wildlife Ecology and Conservation · Forensic Anthropology and Bioarchaeology Studies
MethodsSoftmax · Attention Is All You Need
