The Feasibility of Algorithmic Detection and Decentralised Moderation for Protecting Women from Online Abuse
Sarah Barrington

TL;DR
This paper explores the potential of decentralized, algorithmic moderation using multidimensional abuse indicators on Twitter to better protect women from online harassment, addressing limitations of traditional moderation methods.
Contribution
It introduces three multidimensional abuse indicators, demonstrates how to extract them from Twitter data, and proposes a framework for an end-to-end moderation algorithm targeting female-targeted online abuse.
Findings
Identified three key multidimensional abuse indicators.
Developed methods to extract these indicators from Twitter data.
Proposed a technical framework for algorithmic moderation.
Abstract
Online abuse is becoming an increasingly prevalent issue in modern-day society, with 41 percent of Americans having experienced online harassment in some capacity in 2021. People who identify as women, in particular, can be subjected to a wide range of abusive behavior online, with gender-specific experiences cited broadly in recent literature across fields such as blogging, politics, and journalism. In response to this rise in abusive content, platforms have been found to largely employ "individualistic moderation" approaches, aiming to protect users from harmful content through the screening and management of singular interactions or accounts. Yet, previous work performed by the author of this paper has shown that in the cases of women in particular, these approaches can often be ineffective; failing to protect users from multi-dimensional abuse spanning prolonged time periods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
