SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia
Ri Chi Ng, Aditi Kumaresan, Yujia Hu, Roy Ka-Wei Lee

TL;DR
SEAHateCheck introduces a culturally relevant dataset and testing framework for hate speech detection in low-resource Southeast Asian languages, revealing model limitations and guiding improvements for inclusive online moderation.
Contribution
This work presents the first functional test suite for hate speech detection in Indonesian, Thai, Tagalog, and Vietnamese, enhancing cultural relevance and diagnostic capabilities.
Findings
Models perform poorly on Tagalog and slang-based test cases.
Diagnostic insights reveal weaknesses in implicit hate detection.
Models struggle with culturally nuanced expressions.
Abstract
Hate speech detection relies heavily on linguistic resources, which are primarily available in high-resource languages such as English and Chinese, creating barriers for researchers and platforms developing tools for low-resource languages in Southeast Asia, where diverse socio-linguistic contexts complicate online hate moderation. To address this, we introduce SEAHateCheck, a pioneering dataset tailored to Indonesia, Thailand, the Philippines, and Vietnam, covering Indonesian, Tagalog, Thai, and Vietnamese. Building on HateCheck's functional testing framework and refining SGHateCheck's methods, SEAHateCheck provides culturally relevant test cases, augmented by large language models and validated by local experts for accuracy. Experiments with state-of-the-art and multilingual models revealed limitations in detecting hate speech in specific low-resource languages. In particular, Tagalog…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Spam and Phishing Detection
