Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification
Zhengyang Chen, Shuai Wang, Yanmin Qian

TL;DR
This paper introduces a self-supervised learning based domain adaptation method for speaker verification that leverages unlabeled target data to improve performance, achieving state-of-the-art results without using target labels.
Contribution
The paper proposes a novel SSDA approach that combines self-supervised learning with unsupervised domain adaptation to enhance speaker verification across domains.
Findings
Achieved 10.2% EER on CnCeleb without target labels.
Outperformed traditional UDA methods on the same datasets.
State-of-the-art results on CnCeleb dataset.
Abstract
Large performance degradation is often observed for speaker ver-ification systems when applied to a new domain dataset. Givenan unlabeled target-domain dataset, unsupervised domain adaptation(UDA) methods, which usually leverage adversarial training strate-gies, are commonly used to bridge the performance gap caused bythe domain mismatch. However, such adversarial training strategyonly uses the distribution information of target domain data and cannot ensure the performance improvement on the target domain. Inthis paper, we incorporate self-supervised learning strategy to the un-supervised domain adaptation system and proposed a self-supervisedlearning based domain adaptation approach (SSDA). Compared tothe traditional UDA method, the new SSDA training strategy canfully leverage the potential label information from target domainand adapt the speaker discrimination ability from source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
