TL;DR
DIHARD III is a comprehensive challenge that assesses speaker diarization systems across diverse, real-world audio conditions, highlighting significant progress yet persistent challenges in the field.
Contribution
This paper presents the third DIHARD diarization challenge, introducing evaluation across new domains like conversational telephone speech and providing a large-scale benchmark for system robustness.
Findings
Marked improvement in diarization accuracy since DIHARD I.
Significant progress for two-party interactions.
Challenges remain in domains like web videos.
Abstract
DIHARD III was the third in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variability in recording equipment, noise conditions, and conversational domain. Speaker diarization was evaluated under two speech activity conditions (diarization from a reference speech activity vs. diarization from scratch) and 11 diverse domains. The domains span a range of recording conditions and interaction types, including read audio-books, meeting speech, clinical interviews, web videos, and, for the first time, conversational telephone speech. A total of 30 organizations (forming 21teams) from industry and academia submitted 499 valid system outputs. The evaluation results indicate that speaker diarization has improved markedly since DIHARD I, particularly for two-party interactions, but that for many domains (e.g., web video) the problem remains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗pyannote/speaker-diarization-3.1model· 11.3M dl· ♡ 170911.3M dl♡ 1709
- 🤗pyannote/speaker-diarization-community-1model· 2.0M dl· ♡ 2672.0M dl♡ 267
- 🤗pyannote/speaker-diarizationmodel· 741k dl· ♡ 1249741k dl♡ 1249
- 🤗pyannote/speaker-diarization-3.0model· 303k dl· ♡ 214303k dl♡ 214
- 🤗philschmid/pyannote-speaker-diarization-endpointmodel· 36 dl· ♡ 2036 dl♡ 20
- 🤗anilbs/pipelinemodel· 9 dl· ♡ 39 dl♡ 3
- 🤗tawkit/phil-pyannote-speaker-diarization-endpointmodel· 18 dl· ♡ 718 dl♡ 7
- 🤗bhuvanesh25/pyannote-diar-copymodel· 2 dl2 dl
- 🤗johnislarry/cloned-pyannote-speaker-diarization-endpointmodel· 8 dl8 dl
- 🤗paris-iea/speaker-diarizationmodel· 9 dl· ♡ 19 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
