Wespeaker baselines for VoxSRC2023
Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen,, Hongji Wang, Wen Ding

TL;DR
This paper presents baseline results using the wespeaker toolkit for VoxSRC2023, offering accessible guidelines and strong initial systems for participants, especially newcomers, to develop speaker recognition models.
Contribution
It provides well-structured recipes and baseline results to facilitate entry-level development for VoxSRC2023 using the wespeaker toolkit.
Findings
Achieved strong baseline results on VoxSRC2023 dev set.
Provided clear guidelines for developing speaker recognition systems.
Supported newcomers with accessible starting points.
Abstract
This report showcases the results achieved using the wespeaker toolkit for the VoxSRC2023 Challenge. Our aim is to provide participants, especially those with limited experience, with clear and straightforward guidelines to develop their initial systems. Via well-structured recipes and strong results, we hope to offer an accessible and good enough start point for all interested individuals. In this report, we describe the results achieved on the VoxSRC2023 dev set using the pretrained models, you can check the CodaLab evaluation server for the results on the evaluation set.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems
