LLMs and Finetuning: Benchmarking cross-domain performance for hate   speech detection

Ahmad Nasir; Aadish Sharma; Kokil Jaidka; Saifuddin Ahmed

arXiv:2310.18964·cs.CL·May 1, 2025·1 cites

LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection

Ahmad Nasir, Aadish Sharma, Kokil Jaidka, Saifuddin Ahmed

PDF

Open Access

TL;DR

This paper benchmarks the cross-domain performance of large language models in hate speech detection, highlighting their advantages, limitations, and the influence of dataset features and training parameters.

Contribution

It provides a comprehensive analysis of LLMs' effectiveness in hate speech detection across domains and discusses best practices for benchmarking such models.

Findings

01

LLMs outperform state-of-the-art methods even without pretraining

02

Fine-grained labels' advantage diminishes with larger datasets

03

Limitations include issues with validity and reproducibility

Abstract

In the evolving landscape of online communication, hate speech detection remains a formidable challenge, further compounded by the diversity of digital platforms. This study investigates the effectiveness and adaptability of pre-trained and fine-tuned Large Language Models (LLMs) in identifying hate speech, to address two central questions: (1) To what extent does the model performance depend on the fine-tuning and training parameters?, (2) To what extent do models generalize to cross-domain hate speech detection? and (3) What are the specific features of the datasets or models that influence the generalization potential? The experiment shows that LLMs offer a huge advantage over the state-of-the-art even without pretraining. Ordinary least squares analyses suggest that the advantage of training with fine-grained hate speech labels is washed away with the increase in dataset size. While…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting

MethodsLLaMA