An Investigation of Large Language Models for Real-World Hate Speech   Detection

Keyan Guo; Alexander Hu; Jaden Mu; Ziheng Shi; Ziming Zhao; and Nishant Vishwamitra; Hongxin Hu

arXiv:2401.03346·cs.CY·January 9, 2024·2 cites

An Investigation of Large Language Models for Real-World Hate Speech Detection

Keyan Guo, Alexander Hu, Jaden Mu, Ziheng Shi, Ziming Zhao, and Nishant Vishwamitra, Hongxin Hu

PDF

Open Access

TL;DR

This paper explores the use of large language models with innovative prompting strategies for more accurate, context-aware hate speech detection, outperforming traditional models across multiple datasets.

Contribution

It introduces four novel prompting strategies for LLMs, demonstrating their effectiveness in capturing context and improving hate speech detection accuracy.

Findings

01

LLMs often outperform traditional machine learning models in hate speech detection

02

A well-designed reasoning prompt significantly enhances LLM performance

03

Prompting strategies are crucial for leveraging LLMs' knowledge in context-aware detection

Abstract

Hate speech has emerged as a major problem plaguing our social spaces today. While there have been significant efforts to address this problem, existing methods are still significantly limited in effectively detecting hate speech online. A major limitation of existing methods is that hate speech detection is a highly contextual problem, and these methods cannot fully capture the context of hate speech to make accurate predictions. Recently, large language models (LLMs) have demonstrated state-of-the-art performance in several natural language tasks. LLMs have undergone extensive training using vast amounts of natural language data, enabling them to grasp intricate contextual details. Hence, they could be used as knowledge bases for context-aware hate speech detection. However, a fundamental problem with using LLMs to detect hate speech is that there are no studies on effectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting

MethodsBalanced Selection