Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem

Qiang Zhang; Jason Naradowsky; Yusuke Miyao

arXiv:2204.10521·cs.CL·April 25, 2022

Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem

Qiang Zhang, Jason Naradowsky, Yusuke Miyao

PDF

Open Access 1 Repo

TL;DR

This paper presents a new approach to implicit offensive text detection in dialogues, emphasizing multi-hop reasoning and introducing the SLIGHT dataset with annotated reasoning chains, revealing the limitations of current methods.

Contribution

It introduces SLIGHT, a dataset with reasoning chains for implicit offensive detection, and demonstrates the potential of multi-hop reasoning models to improve detection accuracy.

Findings

01

State-of-the-art methods achieve only ~11% accuracy on implicit offensive detection.

02

Multi-hop reasoning models can improve detection performance.

03

Analysis highlights the importance of commonsense knowledge in understanding offensive statements.

Abstract

We introduce the task of implicit offensive text detection in dialogues, where a statement may have either an offensive or non-offensive interpretation, depending on the listener and context. We argue that reasoning is crucial for understanding this broader class of offensive utterances and release SLIGHT, a dataset to support research on this task. Experiments using the data show that state-of-the-art methods of offense detection perform poorly when asked to detect implicitly offensive statements, achieving only $\sim 11%$ accuracy. In contrast to existing offensive text detection datasets, SLIGHT features human-annotated chains of reasoning which describe the mental process by which an offensive interpretation can be reached from each ambiguous statement. We explore the potential for a multi-hop reasoning approach by utilizing existing entailment models to score the probability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qzx7/slight
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection