The VoxCeleb Speaker Recognition Challenge: A Retrospective

Jaesung Huh; Joon Son Chung; Arsha Nagrani; Andrew Brown; Jee-weon; Jung; Daniel Garcia-Romero; Andrew Zisserman

arXiv:2408.14886·cs.SD·August 28, 2024

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon, Jung, Daniel Garcia-Romero, Andrew Zisserman

PDF

TL;DR

This paper reviews the VoxCeleb Speaker Recognition Challenges from 2019 to 2023, highlighting progress, methods, and challenges in speaker verification and diarisation, and providing insights for researchers and organizers.

Contribution

It offers a comprehensive retrospective analysis of the VoxSRC challenges, detailing methodological evolution, performance trends, and insights into current and future challenges in speaker recognition.

Findings

01

Performance improved over the five challenge editions.

02

Special focus areas influenced participant strategies and results.

03

The field faces ongoing challenges in domain adaptation and open-set recognition.

Abstract

The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the tasks of speaker recognition and diarisation under various settings including: closed and open training data; as well as supervised, self-supervised, and semi-supervised training for domain adaptation. The challenges also provided publicly available training and evaluation datasets for each task and setting, with new test sets released each year. In this paper, we provide a review of these challenges that covers: what they explored; the methods developed by the challenge participants and how these evolved; and also the current state of the field for speaker verification and diarisation. We chart the progress in performance over the five installments of the challenge on a common evaluation dataset and provide a detailed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus