Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem
Qiang Zhang, Jason Naradowsky, Yusuke Miyao

TL;DR
This paper presents a new approach to implicit offensive text detection in dialogues, emphasizing multi-hop reasoning and introducing the SLIGHT dataset with annotated reasoning chains, revealing the limitations of current methods.
Contribution
It introduces SLIGHT, a dataset with reasoning chains for implicit offensive detection, and demonstrates the potential of multi-hop reasoning models to improve detection accuracy.
Findings
State-of-the-art methods achieve only ~11% accuracy on implicit offensive detection.
Multi-hop reasoning models can improve detection performance.
Analysis highlights the importance of commonsense knowledge in understanding offensive statements.
Abstract
We introduce the task of implicit offensive text detection in dialogues, where a statement may have either an offensive or non-offensive interpretation, depending on the listener and context. We argue that reasoning is crucial for understanding this broader class of offensive utterances and release SLIGHT, a dataset to support research on this task. Experiments using the data show that state-of-the-art methods of offense detection perform poorly when asked to detect implicitly offensive statements, achieving only accuracy. In contrast to existing offensive text detection datasets, SLIGHT features human-annotated chains of reasoning which describe the mental process by which an offensive interpretation can be reached from each ambiguous statement. We explore the potential for a multi-hop reasoning approach by utilizing existing entailment models to score the probability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
