Finetuning for Sarcasm Detection with a Pruned Dataset
Ishita Goyal, Priyank Bhandia, Sanjana Dulam

TL;DR
This paper improves sarcasm detection by fine-tuning a RoBERTa model with a significantly reduced Reddit dataset, achieving near state-of-the-art performance on the iSarcasm dataset.
Contribution
It introduces a pruned, smaller version of the Reddit dataset to enhance sarcasm detection models without extensive data requirements.
Findings
Achieved within 0.02 F1 of state-of-the-art performance.
Used a dataset 100 times smaller than previous training data.
Demonstrated effective fine-tuning with reduced dataset size.
Abstract
Sarcasm is a form of irony that involves saying or writing something that is opposite or opposite to what one really means, often in a humorous or mocking way. It is often used to mock or mock someone or something, or to be humorous or amusing. Sarcasm is usually conveyed through tone of voice, facial expressions, or other forms of nonverbal communication, but it can also be indicated by the use of certain words or phrases that are typically associated with irony or humor. Sarcasm detection is difficult because it relies on context and non-verbal cues. It can also be culturally specific, subjective and ambiguous. In this work, we fine-tune the RoBERTa based sarcasm detection model presented in Abaskohi et al. [2022] to get to within 0.02 F1 of the state-of-the-art (Hercog et al. [2022]) on the iSarcasm dataset (Oprea and Magdy [2019]). This performance is achieved by augmenting iSarcasm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Metaphor, and Cognition · Sentiment Analysis and Opinion Mining · Linguistics and Discourse Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Dense Connections · Linear Warmup With Linear Decay · Softmax · WordPiece · Layer Normalization · Adam
