Emirati-Accented Speaker Identification in Stressful Talking Conditions

Ismail Shahin; Ali Bou Nassif

arXiv:1909.13070·cs.SD·October 30, 2019

Emirati-Accented Speaker Identification in Stressful Talking Conditions

Ismail Shahin, Ali Bou Nassif

PDF

TL;DR

This study enhances Emirati-accented speaker identification in stressful conditions by comparing different Hidden Markov Models, achieving accuracy comparable to human listeners.

Contribution

It introduces a comparative analysis of HMM1s, HMM2s, and HMM3s for stress-resilient Emirati speaker identification using a new database.

Findings

01

HMM3s achieved 65.0% accuracy in stressful conditions.

02

HMM-based methods perform comparably to human listeners.

03

Stressful talking conditions reduce identification accuracy.

Abstract

This research is dedicated to improving text-independent Emirati-accented speaker identification performance in stressful talking conditions using three distinct classifiers: First-Order Hidden Markov Models (HMM1s), Second-Order Hidden Markov Models (HMM2s), and Third-Order Hidden Markov Models (HMM3s). The database that has been used in this work was collected from 25 per gender Emirati native speakers uttering eight widespread Emirati sentences in each of neutral, shouted, slow, loud, soft, and fast talking conditions. The extracted features of the captured database are called Mel-Frequency Cepstral Coefficients (MFCCs). Based on HMM1s, HMM2s, and HMM3s, average Emirati-accented speaker identification accuracy in stressful conditions is 58.6%, 61.1%, and 65.0%, respectively. The achieved average speaker identification accuracy in stressful conditions based on HMM3s is so similar to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.