The Unappreciated Role of Intent in Algorithmic Moderation of Social   Media Content

Xinyu Wang; Sai Koneru; Pranav Narayanan Venkit; Brett Frischmann,; Sarah Rajtmajer

arXiv:2405.11030·cs.CL·May 21, 2024

The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content

Xinyu Wang, Sai Koneru, Pranav Narayanan Venkit, Brett Frischmann,, Sarah Rajtmajer

PDF

Open Access

TL;DR

This paper highlights the importance of understanding user intent in social media content moderation, reviews current detection models and datasets, and proposes design improvements for ethical and policy alignment.

Contribution

It critically analyzes the gap between platform policies considering intent and current detection models, proposing strategic design changes to better incorporate intent understanding.

Findings

01

Current models lack effective intent detection capabilities

02

Benchmark datasets do not adequately capture intent aspects

03

Proposed design strategies aim to improve ethical alignment

Abstract

As social media has become a predominant mode of communication globally, the rise of abusive content threatens to undermine civil discourse. Recognizing the critical nature of this issue, a significant body of research has been dedicated to developing language models that can detect various types of online abuse, e.g., hate speech, cyberbullying. However, there exists a notable disconnect between platform policies, which often consider the author's intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts to capture intent. This paper examines the role of intent in content moderation systems. We review state of the art detection models and benchmark training datasets for online abuse to assess their awareness and ability to capture intent. We propose strategic changes to the design and development of automated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection