Can We Automate the Analysis of Online Child Sexual Exploitation   Discourse?

Darren Cook; Miri Zilka; Heidi DeSandre; Susan Giles; Adrian Weller,; Simon Maskell

arXiv:2209.12320·cs.CL·September 27, 2022·1 cites

Can We Automate the Analysis of Online Child Sexual Exploitation Discourse?

Darren Cook, Miri Zilka, Heidi DeSandre, Susan Giles, Adrian Weller,, Simon Maskell

PDF

Open Access

TL;DR

This study explores the potential of automated natural language processing methods to identify predatory behaviors in online chats between minors and adults, aiming to replace manual expert annotation and improve online child safety monitoring.

Contribution

It introduces a framework for automatically detecting grooming behaviors in chat messages, informed by psychological theories, and evaluates the performance of NLP models against human annotations.

Findings

01

Models show consistent classification of behaviors

02

Automated methods outperform random baselines

03

Approach aligns with human annotations but needs improvement

Abstract

Social media's growing popularity raises concerns around children's online safety. Interactions between minors and adults with predatory intentions is a particularly grave concern. Research into online sexual grooming has often relied on domain experts to manually annotate conversations, limiting both scale and scope. In this work, we test how well-automated methods can detect conversational behaviors and replace an expert human annotator. Informed by psychological theories of online grooming, we label $6772$ chat messages sent by child-sex offenders with one of eleven predatory behaviors. We train bag-of-words and natural language inference models to classify each behavior, and show that the best performing models classify behaviors in a manner that is consistent, but not on-par, with human annotation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Cybercrime and Law Enforcement Studies · Spam and Phishing Detection

MethodsTest