Towards hate speech detection in low-resource languages: Comparing ASR   to acoustic word embeddings on Wolof and Swahili

Christiaan Jacobs; Nathana\"el Carraz Rakotonirina; Everlyn Asiko; Chimoto; Bruce A. Bassett; Herman Kamper

arXiv:2306.00410·cs.CL·June 2, 2023·1 cites

Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

Christiaan Jacobs, Nathana\"el Carraz Rakotonirina, Everlyn Asiko, Chimoto, Bruce A. Bassett, Herman Kamper

PDF

Open Access

TL;DR

This paper compares automatic speech recognition and acoustic word embeddings for hate speech keyword detection in low-resource languages, finding AWE more robust in real-world scenarios with minimal training data.

Contribution

It demonstrates that acoustic word embeddings can outperform traditional ASR in low-resource, real-world hate speech detection tasks with limited labeled data.

Findings

01

AWE performs better in in-the-wild scenarios with minimal data

02

ASR outperforms AWE in controlled, same-domain experiments

03

AWE requires significantly less training data to achieve comparable performance

Abstract

We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data from well-resourced languages to spot keywords in data in the unseen target language. In contrast to ASR, the AWE approach only requires a few keyword exemplars. In controlled experiments on Wolof and Swahili where training and test data are from the same domain, an ASR model trained on just five minutes of data outperforms the AWE approach. But in an in-the-wild test on Swahili radio broadcasts with actual hate speech keywords, the AWE model (using one minute of template data) is more robust,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Speech and Audio Processing · Speech Recognition and Synthesis

MethodsTest