Language Models for Adult Service Website Text Analysis
Nickolas Freeman, Thanh Nguyen, Gregory Bott, Jason Parton, Collin Francel

TL;DR
This study evaluates various language modeling techniques for analyzing adult service website texts linked to sex trafficking, demonstrating that custom transformer models outperform existing models and can be efficiently trained with limited resources.
Contribution
The paper introduces custom transformer models tailored for ASW text analysis, showing they outperform well-known models and can be trained efficiently with small GPU resources.
Findings
Custom transformer models outperform BERT, RoBERTa, and ModernBERT in accuracy and recall.
Efficient training of models with limited GPU resources is feasible.
Models enable advanced analysis like graph decomposition, clustering, and emoji usage understanding.
Abstract
Sex trafficking refers to the use of force, fraud, or coercion to compel an individual to perform in commercial sex acts against their will. Adult service websites (ASWs) have and continue to be linked to sex trafficking, offering a platform for traffickers to advertise their victims. Thus, organizations involved in the fight against sex trafficking often use ASW data when attempting to identify potential sex trafficking victims. A critical challenge in transforming ASW data into actionable insight is text analysis. Previous research using ASW data has shown that ASW ad text is important for linking ads. However, working with this text is challenging due to its extensive use of emojis, poor grammar, and deliberate obfuscation to evade law enforcement scrutiny. We conduct a comprehensive study of language modeling approaches for this application area, including simple information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Web Data Mining and Analysis
