Graph-based Features for Automatic Online Abuse Detection

Etienne Papegnies (LIA); Vincent Labatut (LIA); Richard Dufour (LIA),; Georges Linares (LIA)

arXiv:1708.01060·cs.IR·October 17, 2017

Graph-based Features for Automatic Online Abuse Detection

Etienne Papegnies (LIA), Vincent Labatut (LIA), Richard Dufour (LIA),, Georges Linares (LIA)

PDF

TL;DR

This paper explores the use of conversational network graph features derived from chat logs to detect online abuse, achieving high classification performance comparable to content-based methods.

Contribution

It introduces a graph-based feature extraction method from chat logs for abuse detection, demonstrating its effectiveness in classification tasks.

Findings

01

Graph features significantly improve abuse detection accuracy.

02

Performance comparable to content-based approaches.

03

Graph-based approach offers robustness against message obfuscation.

Abstract

While online communities have become increasingly important over the years, the moderation of user-generated content is still performed mostly manually. Automating this task is an important step in reducing the financial cost associated with moderation, but the majority of automated approaches strictly based on message content are highly vulnerable to intentional obfuscation. In this paper, we discuss methods for extracting conversational networks based on raw multi-participant chat logs, and we study the contribution of graph features to a classification system that aims to determine if a given message is abusive. The conversational graph-based system yields unexpectedly high performance , with results comparable to those previously obtained with a content-based approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.