Detecting Harmful Content On Online Platforms: What Platforms Need Vs.   Where Research Efforts Go

Arnav Arora; Preslav Nakov; Momchil Hardalov; Sheikh Muhammad Sarwar,; Vibha Nayak; Yoan Dinkov; Dimitrina Zlatkova; Kyle Dent; Ameya Bhatawdekar,; Guillaume Bouchard; Isabelle Augenstein

arXiv:2103.00153·cs.CL·June 7, 2023·5 cites

Detecting Harmful Content On Online Platforms: What Platforms Need Vs. Where Research Efforts Go

Arnav Arora, Preslav Nakov, Momchil Hardalov, Sheikh Muhammad Sarwar,, Vibha Nayak, Yoan Dinkov, Dimitrina Zlatkova, Kyle Dent, Ameya Bhatawdekar,, Guillaume Bouchard, Isabelle Augenstein

PDF

Open Access

TL;DR

This paper surveys the gap between the types of harmful online content platforms aim to control and the focus of current research efforts, highlighting areas for future investigation.

Contribution

It provides a comprehensive analysis of existing detection methods and moderation policies, identifying mismatches and suggesting future research directions.

Findings

01

Research often focuses on narrow harmful content types

02

Platforms seek to curb diverse harmful content

03

Significant gaps exist between platform needs and research efforts

Abstract

The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting harmful content, often focusing on specific sub-problems or on narrow communities, as what is considered harmful often depends on the platform and on the context. We argue that there is currently a dichotomy between what types of harmful content online platforms seek to curb, and what research efforts there are to automatically detect such content. We thus survey existing methods as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Social Media and Politics