Really? Well. Apparently Bootstrapping Improves the Performance of   Sarcasm and Nastiness Classifiers for Online Dialogue

Stephanie Lukin; Marilyn Walker

arXiv:1708.08572·cs.CL·August 30, 2017·24 cites

Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue

Stephanie Lukin, Marilyn Walker

PDF

Open Access

TL;DR

This paper demonstrates that bootstrapping methods can significantly improve classifiers for detecting sarcasm and nastiness in online dialogue, addressing challenges posed by social media's conversational nature.

Contribution

It adapts and tests a bootstrapping approach from monologic NLP to dialogic contexts, enhancing sarcasm and nastiness detection with improved precision and recall.

Findings

01

Best sarcasm classifier: 62% precision, 52% recall.

02

Nastiness classifier: 75% precision, 62% recall.

03

Bootstrapping improves classifier performance in social media dialogue.

Abstract

More and more of the information on the web is dialogic, from Facebook newsfeeds, to forum conversations, to comment threads on news articles. In contrast to traditional, monologic Natural Language Processing resources such as news, highly social dialogue is frequent in social media, making it a challenging context for NLP. This paper tests a bootstrapping method, originally proposed in a monologic domain, to train classifiers to identify two different types of subjective language in dialogue: sarcasm and nastiness. We explore two methods of developing linguistic indicators to be used in a first level classifier aimed at maximizing precision at the expense of recall. The best performing classifier for the first phase achieves 54% precision and 38% recall for sarcastic utterances. We then use general syntactic patterns from previous work to create more general sarcasm indicators,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Speech and dialogue systems · Topic Modeling