The VoxCeleb Speaker Recognition Challenge: A Retrospective
Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon, Jung, Daniel Garcia-Romero, Andrew Zisserman

TL;DR
This paper reviews the VoxCeleb Speaker Recognition Challenges from 2019 to 2023, highlighting progress, methods, and challenges in speaker verification and diarisation, and providing insights for researchers and organizers.
Contribution
It offers a comprehensive retrospective analysis of the VoxSRC challenges, detailing methodological evolution, performance trends, and insights into current and future challenges in speaker recognition.
Findings
Performance improved over the five challenge editions.
Special focus areas influenced participant strategies and results.
The field faces ongoing challenges in domain adaptation and open-set recognition.
Abstract
The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the tasks of speaker recognition and diarisation under various settings including: closed and open training data; as well as supervised, self-supervised, and semi-supervised training for domain adaptation. The challenges also provided publicly available training and evaluation datasets for each task and setting, with new test sets released each year. In this paper, we provide a review of these challenges that covers: what they explored; the methods developed by the challenge participants and how these evolved; and also the current state of the field for speaker verification and diarisation. We chart the progress in performance over the five installments of the challenge on a common evaluation dataset and provide a detailed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
