Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Christiaan Jacobs, Nathana\"el Carraz Rakotonirina, Everlyn Asiko, Chimoto, Bruce A. Bassett, Herman Kamper

TL;DR
This paper compares automatic speech recognition and acoustic word embeddings for hate speech keyword detection in low-resource languages, finding AWE more robust in real-world scenarios with minimal training data.
Contribution
It demonstrates that acoustic word embeddings can outperform traditional ASR in low-resource, real-world hate speech detection tasks with limited labeled data.
Findings
AWE performs better in in-the-wild scenarios with minimal data
ASR outperforms AWE in controlled, same-domain experiments
AWE requires significantly less training data to achieve comparable performance
Abstract
We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data from well-resourced languages to spot keywords in data in the unseen target language. In contrast to ASR, the AWE approach only requires a few keyword exemplars. In controlled experiments on Wolof and Swahili where training and test data are from the same domain, an ASR model trained on just five minutes of data outperforms the AWE approach. But in an in-the-wild test on Swahili radio broadcasts with actual hate speech keywords, the AWE model (using one minute of template data) is more robust,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Speech and Audio Processing · Speech Recognition and Synthesis
MethodsTest
