A Study of Acoustic Features in Arabic Speaker Identification under   Noisy Environmental Conditions

Zhor Benhafid; Kawthar Yasmine Zergat; Abderrahmane Amrouche

arXiv:2110.12304·eess.AS·October 26, 2021

A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions

Zhor Benhafid, Kawthar Yasmine Zergat, Abderrahmane Amrouche

PDF

Open Access

TL;DR

This study evaluates the robustness of various acoustic features for Arabic speaker identification in noisy environments, finding GFCC and PNCC outperform traditional MFCC features under different noise conditions.

Contribution

It compares the effectiveness of multiple acoustic features in noisy environments for Arabic speaker identification, highlighting the superior performance of GFCC and PNCC.

Findings

01

GFCC and PNCC outperform MFCC in noisy conditions

02

Robust features improve speaker identification accuracy in noise

03

Performance varies with different noise types and SNR levels

Abstract

One of the major parts of the voice recognition field is the choice of acoustic features which have to be robust against the variability of the speech signal, mismatched conditions, and noisy environments. Thus, different speech feature extraction techniques have been developed. In this paper, we investigate the robustness of several front-end techniques in Arabic speaker identification. We evaluate five different features in babble, factory and subway conditions at the various signal to noise ratios (SNR). The obtained results showed that two of the auditory feature i.e. gammatone frequency cepstral coefficient (GFCC) and power normalization cepstral coefficients (PNCC), unlike their combination performs substantially better than a conventional speaker features i.e. Mel-frequency cepstral coefficients (MFCC).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing