The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic   Speech Recognition

Jonathan Mukiibi; Andrew Katumba; Joyce Nakatumba-Nabende; Ali; Hussein; Josh Meyer

arXiv:2206.09790·cs.CL·June 22, 2022·5 cites

The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

Jonathan Mukiibi, Andrew Katumba, Joyce Nakatumba-Nabende, Ali, Hussein, Josh Meyer

PDF

Open Access

TL;DR

This paper introduces the Makerere Radio Speech Corpus, a 155-hour Luganda radio dataset, enabling development of automatic speech recognition systems for under-resourced languages in Africa.

Contribution

It presents the first publicly available Luganda radio speech dataset and baseline ASR performance results using open source tools.

Findings

01

First publicly available Luganda radio dataset

02

Baseline ASR performance established with Coqui STT

03

Supports development of ASR for under-resourced languages

Abstract

Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions. Initial efforts by the United Nations in Uganda have proved how understanding the perceptions of rural people who are excluded from social media is important in national planning. However, these efforts are being challenged by the absence of transcribed speech datasets. In this paper, The Makerere Artificial Intelligence research lab releases a Luganda radio speech corpus of 155 hours. To our knowledge, this is the first publicly available radio dataset in sub-Saharan Africa. The paper describes the development of the voice corpus and presents baseline Luganda ASR performance results using Coqui STT toolkit, an open source speech recognition toolkit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Speech and dialogue systems