Spoken Language Identification Using Hybrid Feature Extraction Methods

Pawan Kumar; Astik Biswas; A .N. Mishra; Mahesh Chandra

arXiv:1003.5623·cs.SD·March 31, 2010·33 cites

Spoken Language Identification Using Hybrid Feature Extraction Methods

Pawan Kumar, Astik Biswas, A .N. Mishra, Mahesh Chandra

PDF

Open Access

TL;DR

This study explores hybrid feature extraction methods combining MFCC, PLP, BFCC, and RPLP for spoken language identification, demonstrating improved accuracy over traditional features using VQ-DTW and GMM classifiers.

Contribution

It introduces hybrid feature extraction techniques for LID and compares their effectiveness with conventional methods, highlighting BFCC and RPLP as superior features.

Findings

01

Hybrid features outperform conventional features in language identification accuracy.

02

BFCC shows better performance than MFCC across classifiers.

03

RPLP combined with GMM yields the highest identification performance.

Abstract

This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) system. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) along with two hybrid features are used for language Identification. Two hybrid features, Bark Frequency Cepstral Coefficients (BFCC) and Revised Perceptual Linear Prediction Coefficients (RPLP) were obtained from combination of MFCC and PLP. Two different classifiers, Vector Quantization (VQ) with Dynamic Time Warping (DTW) and Gaussian Mixture Model (GMM) were used for classification. The experiment shows better identification rate using hybrid feature extraction techniques compared to conventional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing