I4U System Description for NIST SRE'20 CTS Challenge

Kong Aik Lee; Tomi Kinnunen; Daniele Colibro; Claudio Vair; Andreas; Nautsch; Hanwu Sun; Liang He; Tianyu Liang; Qiongqiong Wang; Mickael Rouvier,; Pierre-Michel Bousquet; Rohan Kumar Das; Ignacio Vi\~nals Bailo; Meng Liu,; H\'ector Deldago; Xuechen Liu; Md Sahidullah; Sandro Cumani; Boning Zhang,; Koji Okabe; Hitoshi Yamamoto; Ruijie Tao; Haizhou Li; Alfonso Ortega; Gim\'enez; Longbiao Wang; Luis Buera

arXiv:2211.01091·eess.AS·November 3, 2022

I4U System Description for NIST SRE'20 CTS Challenge

Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas, Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier,, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Vi\~nals Bailo, Meng Liu,, H\'ector Deldago, Xuechen Liu, Md Sahidullah

PDF

Open Access

TL;DR

This paper details the I4U system submission to the NIST SRE'20 CTS Challenge, highlighting collaborative efforts, system fusion, and standardized evaluation procedures to improve speaker recognition performance.

Contribution

It introduces a multi-team collaborative system fusion approach with standardized evaluation protocols for the NIST SRE'20 CTS Challenge.

Findings

01

Fusion of top-performing sub-systems improved recognition accuracy.

02

Standardized development and validation sets facilitated consistent evaluation.

03

Collaborative approach enhanced overall system robustness.

Abstract

This manuscript describes the I4U submission to the 2020 NIST Speaker Recognition Evaluation (SRE'20) Conversational Telephone Speech (CTS) Challenge. The I4U's submission was resulted from active collaboration among researchers across eight research teams - I $^{2}$ R (Singapore), UEF (Finland), VALPT (Italy, Spain), NEC (Japan), THUEE (China), LIA (France), NUS (Singapore), INRIA (France) and TJU (China). The submission was based on the fusion of top performing sub-systems and sub-fusion systems contributed by individual teams. Efforts have been spent on the use of common development and validation sets, submission schedule and milestone, minimizing inconsistency in trial list and score file format across sites.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing