The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022
Jingguang Tian, Xinhui Hu, Xinkang Xu

TL;DR
This paper presents the Royalflush system for VoxCeleb Speaker Recognition Challenge 2022, featuring a U-Net-based extractor for supervised verification and a semi-supervised approach with domain adaptation, achieving state-of-the-art results.
Contribution
Introduces a novel U-Net-based speaker embedding extractor and a semi-supervised domain adaptation method for speaker verification.
Findings
U-Net-based extractor outperforms ECAPA-TDNN with 20.7% relative EER reduction.
Semi-supervised domain adaptation improves target domain performance.
Fusion of multiple models yields top performance on validation set.
Abstract
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). Our submissions contain track 1, which is for supervised speaker verification and track 3, which is for semi-supervised speaker verification. For track 1, we develop a powerful U-Net-based speaker embedding extractor with a symmetric architecture. The proposed system achieves 2.06% in EER and 0.1293 in MinDCF on the validation set. Compared with the state-of-the-art ECAPA-TDNN, it obtains a relative improvement of 20.7% in EER and 22.70% in MinDCF. For track 3, we employ the joint training of source domain supervision and target domain self-supervision to get a speaker embedding extractor. The subsequent clustering process can obtain target domain pseudo-speaker labels. We adapt the speaker embedding extractor using all source and target domain data in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques
