Detecting Community Sensitive Norm Violations in Online Conversations

Chan Young Park; Julia Mendelsohn; Karthik Radhakrishnan; Kinjal Jain,; Tushar Kanakagiri; David Jurgens; Yulia Tsvetkov

arXiv:2110.04419·cs.CL·October 12, 2021

Detecting Community Sensitive Norm Violations in Online Conversations

Chan Young Park, Julia Mendelsohn, Karthik Radhakrishnan, Kinjal Jain,, Tushar Kanakagiri, David Jurgens, Yulia Tsvetkov

PDF

Open Access 1 Repo

TL;DR

This paper presents a new dataset and models for detecting a broader range of community norm violations in online conversations, emphasizing context and community sensitivity to improve moderation accuracy.

Contribution

It introduces a comprehensive dataset and models that detect diverse norm violations considering local and global community contexts, advancing beyond toxicity detection.

Findings

01

Models achieve high performance in detecting norm violations.

02

Context-aware approaches outperform baseline toxicity detection methods.

03

Dataset covers a wider spectrum of community norms.

Abstract

Online platforms and communities establish their own norms that govern what behavior is acceptable within the community. Substantial effort in NLP has focused on identifying unacceptable behaviors and, recently, on forecasting them before they occur. However, these efforts have largely focused on toxicity as the sole form of community norm violation. Such focus has overlooked the much larger set of rules that moderators enforce. Here, we introduce a new dataset focusing on a more complete spectrum of community norms and their violations in the local conversational and global community contexts. We introduce a series of models that use this data to develop context- and community-sensitive norm violation detection, showing that these changes give high performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chan0park/normvio
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Software Engineering Research · Wikis in Education and Collaboration