Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management
Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil, Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin, Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

TL;DR
This paper introduces Demarcation, a comprehensive scoring system for abusive speech that considers severity, target presence, context, and legality, proposing tailored moderation actions beyond simple bans.
Contribution
It presents a novel multi-aspect scoring framework for abusive speech, integrating legal and contextual factors to guide nuanced moderation strategies.
Findings
Proposes Demarcation scoring system with four key aspects.
Analyzes diverse regulations and moderation practices.
Highlights the need for tailored, proactive moderation measures.
Abstract
Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcation scoring abusive speech based on four aspect -- (i) severity scale; (ii) presence of a target; (iii) context scale; (iv) legal scale -- and suggesting more options of actions like detoxification, counter speech generation, blocking, or, as a final measure, human intervention. Through a thorough analysis of abusive speech regulations across diverse jurisdictions, platforms, and research papers we highlight the gap in preventing measures and advocate for tailored proactive steps to combat its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
