i Vector used in Speaker Identification by Dimension Compactness
Soumen Kanrar

TL;DR
This paper introduces a method for efficient feature extraction in speaker identification by utilizing vector dimension compactness in total variability space and cosine distance scoring for quick predictions on small utterances.
Contribution
It proposes a novel implementation of dimension compactness in total variability space combined with cosine distance scoring for improved speaker identification efficiency.
Findings
Enhanced speed in speaker prediction for small utterances
Effective feature representation using dimension compactness
Improved accuracy in acoustic signal classification
Abstract
The automatic speaker identification procedure is used to extract features that help to identify the components of the acoustic signal by discarding all the other stuff like background noise, emotion, hesitation, etc. The acoustic signal is generated by a human that is filtered by the shape of the vocal tract, including tongue, teeth, etc. The shape of the vocal tract determines and produced, what signal comes out in real time. The analytically develops shape of the vocal tract, which exhibits envelop for the short time power spectrum. The ASR needs efficient way of extracting features from the acoustic signal that is used effectively to makes the shape of the individual vocal tract. To identify any acoustic signal in the large collection of acoustic signal i.e. corpora, it needs dimension compactness of total variability space by using the GMM mean super vector. This work presents the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques
