Demarked: A Strategy for Enhanced Abusive Speech Moderation through   Counterspeech, Detoxification, and Message Management

Seid Muhie Yimam; Daryna Dementieva; Tim Fischer; Daniil; Moskovskiy; Naquee Rizwan; Punyajoy Saha; Sarthak Roy; Martin; Semmann; Alexander Panchenko; Chris Biemann; Animesh Mukherjee

arXiv:2406.19543·cs.CL·July 1, 2024

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil, Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin, Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

PDF

Open Access

TL;DR

This paper introduces Demarcation, a comprehensive scoring system for abusive speech that considers severity, target presence, context, and legality, proposing tailored moderation actions beyond simple bans.

Contribution

It presents a novel multi-aspect scoring framework for abusive speech, integrating legal and contextual factors to guide nuanced moderation strategies.

Findings

01

Proposes Demarcation scoring system with four key aspects.

02

Analyzes diverse regulations and moderation practices.

03

Highlights the need for tailored, proactive moderation measures.

Abstract

Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcation scoring abusive speech based on four aspect -- (i) severity scale; (ii) presence of a target; (iii) context scale; (iv) legal scale -- and suggesting more options of actions like detoxification, counter speech generation, blocking, or, as a final measure, human intervention. Through a thorough analysis of abusive speech regulations across diverse jurisdictions, platforms, and research papers we highlight the gap in preventing measures and advocate for tailored proactive steps to combat its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection