Generative x-vectors for text-independent speaker verification
Longting Xu, Rohan Kumar Das, Emre Y{\i}lmaz, Jichen Yang, Haizhou Li

TL;DR
This paper introduces generative x-vectors, a novel approach combining i-vector and x-vector information via a transformation model, significantly improving speaker verification performance especially for long-duration utterances.
Contribution
The paper proposes a new generative x-vector method using canonical correlation analysis to enhance speaker verification by integrating i-vector and x-vector features.
Findings
Generative x-vectors outperform baseline i-vector and x-vector systems.
The method improves performance on long-duration utterances.
Comparable results to fusion systems for short-duration utterances.
Abstract
Speaker verification (SV) systems using deep neural network embeddings, so-called the x-vector systems, are becoming popular due to its good performance superior to the i-vector systems. The fusion of these systems provides improved performance benefiting both from the discriminatively trained x-vectors and generative i-vectors capturing distinct speaker characteristics. In this paper, we propose a novel method to include the complementary information of i-vector and x-vector, that is called generative x-vector. The generative x-vector utilizes a transformation model learned from the i-vector and x-vector representations of the background data. Canonical correlation analysis is applied to derive this transformation model, which is later used to transform the standard x-vectors of the enrollment and test segments to the corresponding generative x-vectors. The SV experiments performed on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques
