Guardians of Discourse: Evaluating LLMs on Multilingual Offensive   Language Detection

Jianfei He; Lilin Wang; Jiaying Wang; Zhenyu Liu; Hongbin Na; Zimu; Wang; Wei Wang; Qi Chen

arXiv:2410.15623·cs.CL·October 22, 2024

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Jianfei He, Lilin Wang, Jiaying Wang, Zhenyu Liu, Hongbin Na, Zimu, Wang, Wei Wang, Qi Chen

PDF

Open Access

TL;DR

This paper evaluates the effectiveness of large language models in detecting offensive language across English, Spanish, and German, highlighting the influence of prompt language, translation data, and inherent biases.

Contribution

First comprehensive multilingual evaluation of LLMs for offensive language detection, analyzing prompt language effects and dataset biases.

Findings

01

LLMs perform variably across languages and settings.

02

Prompt language influences detection accuracy.

03

Biases in models and datasets affect mispredictions.

Abstract

Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Cosine Annealing · Attention Dropout · Softmax · Multi-Head Attention · {Dispute@FaQ-s}How to file a dispute with Expedia? · Linear Warmup With Cosine Annealing · Adam