An Investigation of Large Language Models for Real-World Hate Speech Detection
Keyan Guo, Alexander Hu, Jaden Mu, Ziheng Shi, Ziming Zhao, and Nishant Vishwamitra, Hongxin Hu

TL;DR
This paper explores the use of large language models with innovative prompting strategies for more accurate, context-aware hate speech detection, outperforming traditional models across multiple datasets.
Contribution
It introduces four novel prompting strategies for LLMs, demonstrating their effectiveness in capturing context and improving hate speech detection accuracy.
Findings
LLMs often outperform traditional machine learning models in hate speech detection
A well-designed reasoning prompt significantly enhances LLM performance
Prompting strategies are crucial for leveraging LLMs' knowledge in context-aware detection
Abstract
Hate speech has emerged as a major problem plaguing our social spaces today. While there have been significant efforts to address this problem, existing methods are still significantly limited in effectively detecting hate speech online. A major limitation of existing methods is that hate speech detection is a highly contextual problem, and these methods cannot fully capture the context of hate speech to make accurate predictions. Recently, large language models (LLMs) have demonstrated state-of-the-art performance in several natural language tasks. LLMs have undergone extensive training using vast amounts of natural language data, enabling them to grasp intricate contextual details. Hence, they could be used as knowledge bases for context-aware hate speech detection. However, a fundamental problem with using LLMs to detect hate speech is that there are no studies on effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting
MethodsBalanced Selection
