Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual   Predatory Chats and Abusive Texts

Thanh Thi Nguyen; Campbell Wilson; Janis Dalins

arXiv:2308.14683·cs.CL·August 29, 2023·5 cites

Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts

Thanh Thi Nguyen, Campbell Wilson, Janis Dalins

PDF

Open Access

TL;DR

This paper demonstrates that fine-tuning the open-source Llama 2 7B model effectively detects online sexual predatory chats and abusive language across multiple languages, showing strong, consistent performance in real-world scenarios.

Contribution

It introduces a novel, automated approach using Llama 2 for detecting harmful online content, applicable to multiple languages and datasets, without manual feature-engineering.

Findings

01

High detection accuracy across diverse datasets

02

Effective multilingual and imbalanced data handling

03

Potential for broad real-world applications

Abstract

Detecting online sexual predatory behaviours and abusive language on social media platforms has become a critical area of research due to the growing concerns about online safety, especially for vulnerable populations such as children and adolescents. Researchers have been exploring various techniques and approaches to develop effective detection systems that can identify and mitigate these risks. Recent development of large language models (LLMs) has opened a new opportunity to address this problem more effectively. This paper proposes an approach to detection of online sexual predatory chats and abusive language using the open-source pretrained Llama 2 7B-parameter model, recently released by Meta GenAI. We fine-tune the LLM using datasets with different sizes, imbalance degrees, and languages (i.e., English, Roman Urdu and Urdu). Based on the power of LLMs, our approach is generic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Authorship Attribution and Profiling