ABSP System for The Third DIHARD Challenge
A Kishore Kumar, Shefali Waldekar, Goutam Saha, Md Sahidullah

TL;DR
This paper presents a speaker diarization system for the DIHARD III challenge that uses acoustic domain identification and domain-dependent clustering to improve diarization accuracy.
Contribution
The novel contribution is the development of an ADI system based on speaker embeddings and the integration of domain-dependent thresholds and PCA parameters for improved diarization.
Findings
Achieved a 9.63% relative DER reduction in core conditions.
Achieved a 10.64% relative DER reduction in full conditions.
Demonstrated effectiveness of domain-dependent processing in speaker diarization.
Abstract
This report describes the speaker diarization system developed by the ABSP Laboratory team for the third DIHARD speech diarization challenge. Our primary contribution is to develop acoustic domain identification (ADI) system for speaker diarization. We investigate speaker embeddings based ADI system. We apply a domain-dependent threshold for agglomerative hierarchical clustering. Besides, we optimize the parameters for PCA-based dimensionality reduction in a domain-dependent way. Our method of integrating domain-based processing schemes in the baseline system of the challenge achieved a relative improvement of and in DER for core and full conditions, respectively, for Track 1 of the DIHARD III evaluation set.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Algorithms and Data Compression · Speech and Audio Processing
