Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts
Thanh Thi Nguyen, Campbell Wilson, Janis Dalins

TL;DR
This paper demonstrates that fine-tuning the open-source Llama 2 7B model effectively detects online sexual predatory chats and abusive language across multiple languages, showing strong, consistent performance in real-world scenarios.
Contribution
It introduces a novel, automated approach using Llama 2 for detecting harmful online content, applicable to multiple languages and datasets, without manual feature-engineering.
Findings
High detection accuracy across diverse datasets
Effective multilingual and imbalanced data handling
Potential for broad real-world applications
Abstract
Detecting online sexual predatory behaviours and abusive language on social media platforms has become a critical area of research due to the growing concerns about online safety, especially for vulnerable populations such as children and adolescents. Researchers have been exploring various techniques and approaches to develop effective detection systems that can identify and mitigate these risks. Recent development of large language models (LLMs) has opened a new opportunity to address this problem more effectively. This paper proposes an approach to detection of online sexual predatory chats and abusive language using the open-source pretrained Llama 2 7B-parameter model, recently released by Meta GenAI. We fine-tune the LLM using datasets with different sizes, imbalance degrees, and languages (i.e., English, Roman Urdu and Urdu). Based on the power of LLMs, our approach is generic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Authorship Attribution and Profiling
