Pay Attention to Hard Trials

Lantian Li; Di Wang; Dong Wang

arXiv:2209.04687·cs.SD·September 13, 2022

Pay Attention to Hard Trials

Lantian Li, Di Wang, Dong Wang

PDF

Open Access 1 Repo

TL;DR

This paper highlights the importance of focusing on hard trials in speaker recognition evaluation, proposing an SVM-based method to identify such trials and create more challenging benchmarks for fairer system comparison.

Contribution

It introduces a novel SVM-based approach to identify hard trials and constructs new evaluation sets that improve the assessment of speaker recognition systems.

Findings

01

New evaluation sets with hard trials improve system benchmarking.

02

Re-evaluation of recent technologies shows different performance insights.

03

Code and datasets will be publicly available.

Abstract

Performance of speaker recognition systems is evaluated on test trials. Although as crucial as rulers for tailors, trials have not been carefully treated so far, and most existing benchmarks compose trials by naive cross-pairing. In this paper, we argue that the cross-pairing approach produces overwhelming easy trials, which in turn leads to potential bias in system and technique comparison. To solve the problem, we advocate more attention to hard trials. We present an SVM-based approach to identifying hard trials and use it to construct new evaluation sets for VoxCeleb1 and SITW. With the new sets, we can re-evaluate the contribution of some recent technologies. The code and the identified hard trials will be published online at http://project.cslt.org.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/csltstu/sunine
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and dialogue systems