Speaker Identification using MFCC-Domain Support Vector Machine
S. M. Kamruzzaman, A. N. M. Rezaul Karim, Md. Saiful Islam, and Md., Emdadul Haque

TL;DR
This paper proposes a text-dependent speaker identification method using MFCC features and an SVM trained with SMO, demonstrating improved performance and convergence speed through extensive experiments.
Contribution
It introduces a novel combination of MFCC features with SMO-trained SVMs for speaker identification, enhancing accuracy and efficiency.
Findings
Improved speaker identification accuracy with MFCC-SVM approach
Faster convergence of SVM training using SMO technique
Effective differentiation of speakers based on cepstrum features
Abstract
Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into text-independent and text-dependent. This paper presents a technique of text-dependent speaker identification using MFCC-domain support vector machine (SVM). In this work, melfrequency cepstrum coefficients (MFCCs) and their statistical distribution properties are used as features, which will be inputs to the neural network. This work firstly used sequential minimum optimization (SMO) learning technique for SVM that improve performance over traditional techniques Chunking, Osuna. The cepstrum coefficients representing the speaker characteristics of a speech segment are computed by nonlinear filter bank analysis and discrete cosine transform. The speaker identification ability and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques
