i Vector used in Speaker Identification by Dimension Compactness

Soumen Kanrar

arXiv:1704.03934·cs.SD·April 14, 2017·1 cites

i Vector used in Speaker Identification by Dimension Compactness

Soumen Kanrar

PDF

Open Access

TL;DR

This paper introduces a method for efficient feature extraction in speaker identification by utilizing vector dimension compactness in total variability space and cosine distance scoring for quick predictions on small utterances.

Contribution

It proposes a novel implementation of dimension compactness in total variability space combined with cosine distance scoring for improved speaker identification efficiency.

Findings

01

Enhanced speed in speaker prediction for small utterances

02

Effective feature representation using dimension compactness

03

Improved accuracy in acoustic signal classification

Abstract

The automatic speaker identification procedure is used to extract features that help to identify the components of the acoustic signal by discarding all the other stuff like background noise, emotion, hesitation, etc. The acoustic signal is generated by a human that is filtered by the shape of the vocal tract, including tongue, teeth, etc. The shape of the vocal tract determines and produced, what signal comes out in real time. The analytically develops shape of the vocal tract, which exhibits envelop for the short time power spectrum. The ASR needs efficient way of extracting features from the acoustic signal that is used effectively to makes the shape of the individual vocal tract. To identify any acoustic signal in the large collection of acoustic signal i.e. corpora, it needs dimension compactness of total variability space by using the GMM mean super vector. This work presents the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques