The Gray Area: Characterizing Moderator Disagreement on Reddit
Shayan Alipour, Shruti Phadke, Seyed Shahabeddin Mousavi, Amirhossein Afsharrad, Morteza Zihayat, Mattia Samory

TL;DR
This study analyzes moderator disagreements on Reddit, revealing that disputed moderation cases are common, complex, often involve automated decisions, and are challenging for AI to resolve, emphasizing the importance of human moderators.
Contribution
The paper provides a large-scale characterization of moderation disputes, highlighting the complexity of gray areas and limitations of current AI tools in resolving them.
Findings
One-in-seven moderation cases are disputed.
Gray area cases are harder to adjudicate than undisputed cases.
Language models struggle with gray area moderation decisions.
Abstract
Volunteer moderators play a crucial role in sustaining online dialogue, but they often disagree about what should or should not be allowed. In this paper, we study the complexity of content moderation with a focus on disagreements between moderators, which we term the ``gray area'' of moderation. Leveraging 5 years and 4.3 million moderation log entries from 24 subreddits of different topics and sizes, we characterize how gray area, or disputed cases, differ from undisputed cases. We show that one-in-seven moderation cases are disputed among moderators, often addressing transgressions where users' intent is not directly legible, such as in trolling and brigading, as well as tensions around community governance. This is concerning, as almost half of all gray area cases involved automated moderation decisions. Through information-theoretic evaluations, we demonstrate that gray area cases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Misinformation and Its Impacts
