Speaker Identification using MFCC-Domain Support Vector Machine

S. M. Kamruzzaman; A. N. M. Rezaul Karim; Md. Saiful Islam; and Md.; Emdadul Haque

arXiv:1009.4972·cs.LG·September 28, 2010·19 cites

Speaker Identification using MFCC-Domain Support Vector Machine

S. M. Kamruzzaman, A. N. M. Rezaul Karim, Md. Saiful Islam, and Md., Emdadul Haque

PDF

Open Access

TL;DR

This paper proposes a text-dependent speaker identification method using MFCC features and an SVM trained with SMO, demonstrating improved performance and convergence speed through extensive experiments.

Contribution

It introduces a novel combination of MFCC features with SMO-trained SVMs for speaker identification, enhancing accuracy and efficiency.

Findings

01

Improved speaker identification accuracy with MFCC-SVM approach

02

Faster convergence of SVM training using SMO technique

03

Effective differentiation of speakers based on cepstrum features

Abstract

Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into text-independent and text-dependent. This paper presents a technique of text-dependent speaker identification using MFCC-domain support vector machine (SVM). In this work, melfrequency cepstrum coefficients (MFCCs) and their statistical distribution properties are used as features, which will be inputs to the neural network. This work firstly used sequential minimum optimization (SMO) learning technique for SVM that improve performance over traditional techniques Chunking, Osuna. The cepstrum coefficients representing the speaker characteristics of a speech segment are computed by nonlinear filter bank analysis and discrete cosine transform. The speaker identification ability and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques