Evaluating Language Models on Grooming Risk Estimation Using Fuzzy Theory
Geetanjali Bihani, Tatiana Ringenberg, Julia Rayz

TL;DR
This paper investigates the ability of SBERT language models to assess grooming risk in conversations, highlighting challenges in detecting indirect speech and the need for more robust modeling in high-risk domains.
Contribution
The study evaluates SBERT's effectiveness in discerning grooming risk levels and identifies limitations in handling indirect speech and implicit language.
Findings
Fine-tuning improves grooming score predictions.
High variance in predictions for high-risk contexts.
Errors often occur with indirect speech and non-explicit content.
Abstract
Encoding implicit language presents a challenge for language models, especially in high-risk domains where maintaining high precision is important. Automated detection of online child grooming is one such critical domain, where predators manipulate victims using a combination of explicit and implicit language to convey harmful intentions. While recent studies have shown the potential of Transformer language models like SBERT for preemptive grooming detection, they primarily depend on surface-level features and approximate real victim grooming processes using vigilante and law enforcement conversations. The question of whether these features and approximations are reasonable has not been addressed thus far. In this paper, we address this gap and study whether SBERT can effectively discern varying degrees of grooming risk inherent in conversations, and evaluate its results across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
