A Fuzzy Evaluation of Sentence Encoders on Grooming Risk Classification

Geetanjali Bihani; Julia Rayz

arXiv:2502.12576·cs.CL·February 19, 2025

A Fuzzy Evaluation of Sentence Encoders on Grooming Risk Classification

Geetanjali Bihani, Julia Rayz

PDF

Open Access

TL;DR

This study evaluates sentence encoders for grooming risk classification in online chats, revealing that models struggle with indirect and coded language, emphasizing the need for more robust detection methods in social media safety.

Contribution

The paper introduces a fuzzy evaluation framework to assess how well models align with human perceptions of grooming, especially in coded language scenarios, highlighting limitations of current fine-tuned transformers.

Findings

01

Models fail to detect indirect and coded grooming language

02

High out-of-vocabulary words correlate with misclassification

03

Current models need improvement for noisy chat inputs

Abstract

With the advent of social media, children are becoming increasingly vulnerable to the risk of grooming in online settings. Detecting grooming instances in an online conversation poses a significant challenge as the interactions are not necessarily sexually explicit, since the predators take time to build trust and a relationship with their victim. Moreover, predators evade detection using indirect and coded language. While previous studies have fine-tuned Transformers to automatically identify grooming in chat conversations, they overlook the impact of coded and indirect language on model predictions, and how these align with human perceptions of grooming. In this paper, we address this gap and evaluate bi-encoders on the task of classifying different degrees of grooming risk in chat contexts, for three different participant groups, i.e. law enforcement officers, real victims, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Advanced Malware Detection Techniques · Sentiment Analysis and Opinion Mining

MethodsALIGN