Evaluating Language Models on Grooming Risk Estimation Using Fuzzy   Theory

Geetanjali Bihani; Tatiana Ringenberg; Julia Rayz

arXiv:2502.12563·cs.CL·February 19, 2025

Evaluating Language Models on Grooming Risk Estimation Using Fuzzy Theory

Geetanjali Bihani, Tatiana Ringenberg, Julia Rayz

PDF

Open Access

TL;DR

This paper investigates the ability of SBERT language models to assess grooming risk in conversations, highlighting challenges in detecting indirect speech and the need for more robust modeling in high-risk domains.

Contribution

The study evaluates SBERT's effectiveness in discerning grooming risk levels and identifies limitations in handling indirect speech and implicit language.

Findings

01

Fine-tuning improves grooming score predictions.

02

High variance in predictions for high-risk contexts.

03

Errors often occur with indirect speech and non-explicit content.

Abstract

Encoding implicit language presents a challenge for language models, especially in high-risk domains where maintaining high precision is important. Automated detection of online child grooming is one such critical domain, where predators manipulate victims using a combination of explicit and implicit language to convey harmful intentions. While recent studies have shown the potential of Transformer language models like SBERT for preemptive grooming detection, they primarily depend on surface-level features and approximate real victim grooming processes using vigilante and law enforcement conversations. The question of whether these features and approximations are reasonable has not been addressed thus far. In this paper, we address this gap and study whether SBERT can effectively discern varying degrees of grooming risk inherent in conversations, and evaluate its results across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection