Speaker Verification Using Simple Temporal Features and Pitch   Synchronous Cepstral Coefficients

Bhavana V. S; Pradip K. Das

arXiv:1908.05553·cs.SD·August 16, 2019·1 cites

Speaker Verification Using Simple Temporal Features and Pitch Synchronous Cepstral Coefficients

Bhavana V. S, Pradip K. Das

PDF

Open Access

TL;DR

This paper explores a speaker verification method using simple temporal features and pitch synchronous cepstral coefficients, achieving promising accuracy by combining these features.

Contribution

It introduces a novel feature set combining intra-pitch temporal information with pitch synchronous cepstral coefficients for speaker verification.

Findings

01

Achieved 91.04% accuracy on a 20-speaker database.

02

Combining features improved verification performance.

03

Analysis of misclassified speakers provided insights.

Abstract

Speaker verification is the process by which a speakers claim of identity is tested against a claimed speaker by his or her voice. Speaker verification is done by the use of some parameters (features) from the speakers voice which can be used to differentiate among many speakers. The efficiency of speaker verification system mainly depends on the feature set providing high inter-speaker variability and low intra-speaker variability. There are many methods used for speaker verification. Some systems use Mel Frequency Cepstral Coefficients as features (MFCCs), while others use Hidden Markov Models (HMM) based speaker recognition, Support Vector Machines (SVM), GMMs . In this paper simple intra-pitch temporal information in conjunction with pitch synchronous cepstral coefficients forms the feature set. The distinct feature of a speaker is determined from the steady state part of five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing