Jira: a Kurdish Speech Recognition System Designing and Building Speech   Corpus and Pronunciation Lexicon

Hadi Veisi; Hawre Hosseini; Mohammad Mohammadamini (LIA); Wirya Fathy,; Aso Mahmudi

arXiv:2102.07412·cs.AI·February 16, 2021·5 cites

Jira: a Kurdish Speech Recognition System Designing and Building Speech Corpus and Pronunciation Lexicon

Hadi Veisi, Hawre Hosseini, Mohammad Mohammadamini (LIA), Wirya Fathy,, Aso Mahmudi

PDF

Open Access

TL;DR

This paper presents the development of Jira, the first large vocabulary speech recognition system for Central Kurdish, including a speech corpus, pronunciation lexicon, and acoustic models trained with various methods.

Contribution

It introduces the first Kurdish speech corpus, pronunciation lexicon, and speech recognition system, addressing resource scarcity for the language.

Findings

01

Best model achieved 13.9% word error rate

02

Created 43.68 hours of speech data from 576 speakers

03

Developed a 60K pronunciation lexicon

Abstract

In this paper, we introduce the first large vocabulary speech recognition system (LVSR) for the Central Kurdish language, named Jira. The Kurdish language is an Indo-European language spoken by more than 30 million people in several countries, but due to the lack of speech and text resources, there is no speech recognition system for this language. To fill this gap, we introduce the first speech corpus and pronunciation lexicon for the Kurdish language. Regarding speech corpus, we designed a sentence collection in which the ratio of di-phones in the collection resembles the real data of the Central Kurdish language. The designed sentences are uttered by 576 speakers in a controlled environment with noise-free microphones (called AsoSoft Speech-Office) and in Telegram social network environment using mobile phones (denoted as AsoSoft Speech-Crowdsourcing), resulted in 43.68 hours of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing