AfroDigits: A Community-Driven Spoken Digit Dataset for African   Languages

Chris Chinenye Emezue; Sanchit Gandhi; Lewis Tunstall; Abubakar Abid,; Josh Meyer; Quentin Lhoest; Pete Allen; Patrick Von Platen; Douwe Kiela,; Yacine Jernite; Julien Chaumond; Merve Noyan; Omar Sanseviero

arXiv:2303.12582·cs.CL·April 5, 2023·1 cites

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

Chris Chinenye Emezue, Sanchit Gandhi, Lewis Tunstall, Abubakar Abid,, Josh Meyer, Quentin Lhoest, Pete Allen, Patrick Von Platen, Douwe Kiela,, Yacine Jernite, Julien Chaumond, Merve Noyan, Omar Sanseviero

PDF

Open Access

TL;DR

AfroDigits is the first publicly available spoken digit dataset for 38 African languages, enabling speech technology development for African languages and demonstrating its utility through classification experiments with modern models.

Contribution

This paper introduces AfroDigits, a novel community-driven speech dataset for African languages, and demonstrates its application in digit classification tasks using advanced speech models.

Findings

01

Mixing African speech corpora during finetuning improves model performance.

02

AfroDigits enables development of Afro-centric speech applications.

03

The dataset covers 38 African languages and is publicly available.

Abstract

The advancement of speech technologies has been remarkable, yet its integration with African languages remains limited due to the scarcity of African speech corpora. To address this issue, we present AfroDigits, a minimalist, community-driven dataset of spoken digits for African languages, currently covering 38 African languages. As a demonstration of the practical applications of AfroDigits, we conduct audio digit classification experiments on six African languages [Igbo (ibo), Yoruba (yor), Rundi (run), Oshiwambo (kua), Shona (sna), and Oromo (gax)] using the Wav2Vec2.0-Large and XLS-R models. Our experiments reveal a useful insight on the effect of mixing African speech corpora during finetuning. AfroDigits is the first published audio digit dataset for African languages and we believe it will, among other things, pave the way for Afro-centric speech applications such as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Speech and dialogue systems