I4U System Description for NIST SRE'20 CTS Challenge
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas, Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier,, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Vi\~nals Bailo, Meng Liu,, H\'ector Deldago, Xuechen Liu, Md Sahidullah

TL;DR
This paper details the I4U system submission to the NIST SRE'20 CTS Challenge, highlighting collaborative efforts, system fusion, and standardized evaluation procedures to improve speaker recognition performance.
Contribution
It introduces a multi-team collaborative system fusion approach with standardized evaluation protocols for the NIST SRE'20 CTS Challenge.
Findings
Fusion of top-performing sub-systems improved recognition accuracy.
Standardized development and validation sets facilitated consistent evaluation.
Collaborative approach enhanced overall system robustness.
Abstract
This manuscript describes the I4U submission to the 2020 NIST Speaker Recognition Evaluation (SRE'20) Conversational Telephone Speech (CTS) Challenge. The I4U's submission was resulted from active collaboration among researchers across eight research teams - IR (Singapore), UEF (Finland), VALPT (Italy, Spain), NEC (Japan), THUEE (China), LIA (France), NUS (Singapore), INRIA (France) and TJU (China). The submission was based on the fusion of top performing sub-systems and sub-fusion systems contributed by individual teams. Efforts have been spent on the use of common development and validation sets, submission schedule and milestone, minimizing inconsistency in trial list and score file format across sites.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing
