Jira: a Kurdish Speech Recognition System Designing and Building Speech Corpus and Pronunciation Lexicon
Hadi Veisi, Hawre Hosseini, Mohammad Mohammadamini (LIA), Wirya Fathy,, Aso Mahmudi

TL;DR
This paper presents the development of Jira, the first large vocabulary speech recognition system for Central Kurdish, including a speech corpus, pronunciation lexicon, and acoustic models trained with various methods.
Contribution
It introduces the first Kurdish speech corpus, pronunciation lexicon, and speech recognition system, addressing resource scarcity for the language.
Findings
Best model achieved 13.9% word error rate
Created 43.68 hours of speech data from 576 speakers
Developed a 60K pronunciation lexicon
Abstract
In this paper, we introduce the first large vocabulary speech recognition system (LVSR) for the Central Kurdish language, named Jira. The Kurdish language is an Indo-European language spoken by more than 30 million people in several countries, but due to the lack of speech and text resources, there is no speech recognition system for this language. To fill this gap, we introduce the first speech corpus and pronunciation lexicon for the Kurdish language. Regarding speech corpus, we designed a sentence collection in which the ratio of di-phones in the collection resembles the real data of the Central Kurdish language. The designed sentences are uttered by 576 speakers in a controlled environment with noise-free microphones (called AsoSoft Speech-Office) and in Telegram social network environment using mobile phones (denoted as AsoSoft Speech-Crowdsourcing), resulted in 43.68 hours of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
