Improving Text-Independent Speaker Verification with Auxiliary Speakers   Using Graph

Jingyu Li; Si-Ioi Ng; Tan Lee

arXiv:2109.09674·eess.AS·September 21, 2021

Improving Text-Independent Speaker Verification with Auxiliary Speakers Using Graph

Jingyu Li, Si-Ioi Ng, Tan Lee

PDF

Open Access

TL;DR

This paper introduces a graph-based method that uses auxiliary speaker embeddings to refine similarity scores in text-independent speaker verification, improving robustness without requiring extra data during verification.

Contribution

It proposes a novel graph model that incorporates auxiliary speaker embeddings to enhance similarity score refinement in speaker verification systems.

Findings

01

Improved verification accuracy on Voxceleb datasets

02

Effective use of artificial embeddings as auxiliary speakers

03

End-to-end training of the graph-based verification model

Abstract

The paper presents a novel approach to refining similarity scores between input utterances for robust speaker verification. Given the embeddings from a pair of input utterances, a graph model is designed to incorporate additional information from a group of embeddings representing the so-called auxiliary speakers. The relations between the input utterances and the auxiliary speakers are represented by the edges and vertices in the graph. The similarity scores are refined by iteratively updating the values of the graph's vertices using an algorithm similar to the random walk algorithm on graphs. Through this updating process, the information of auxiliary speakers is involved in determining the relation between input utterances and hence contributing to the verification process. We propose to create a set of artificial embeddings through the model training process. Utilizing the generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling