Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue
Stephanie Lukin, Marilyn Walker

TL;DR
This paper demonstrates that bootstrapping methods can significantly improve classifiers for detecting sarcasm and nastiness in online dialogue, addressing challenges posed by social media's conversational nature.
Contribution
It adapts and tests a bootstrapping approach from monologic NLP to dialogic contexts, enhancing sarcasm and nastiness detection with improved precision and recall.
Findings
Best sarcasm classifier: 62% precision, 52% recall.
Nastiness classifier: 75% precision, 62% recall.
Bootstrapping improves classifier performance in social media dialogue.
Abstract
More and more of the information on the web is dialogic, from Facebook newsfeeds, to forum conversations, to comment threads on news articles. In contrast to traditional, monologic Natural Language Processing resources such as news, highly social dialogue is frequent in social media, making it a challenging context for NLP. This paper tests a bootstrapping method, originally proposed in a monologic domain, to train classifiers to identify two different types of subjective language in dialogue: sarcasm and nastiness. We explore two methods of developing linguistic indicators to be used in a first level classifier aimed at maximizing precision at the expense of recall. The best performing classifier for the first phase achieves 54% precision and 38% recall for sarcastic utterances. We then use general syntactic patterns from previous work to create more general sarcasm indicators,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Speech and dialogue systems · Topic Modeling
