Extrapolating false alarm rates in automatic speaker verification
Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee

TL;DR
This paper introduces generative models for extrapolating false alarm rates in automatic speaker verification, enabling performance prediction on large populations without additional data collection.
Contribution
It proposes a worst-case, generative modeling approach in the detection score space for reliable false alarm rate extrapolation in ASV systems.
Findings
Models enable sampling of new speakers for performance analysis.
Approach facilitates extrapolation without collecting new data.
Applicable to various ASV systems.
Abstract
Automatic speaker verification (ASV) vendors and corpus providers would both benefit from tools to reliably extrapolate performance metrics for large speaker populations without collecting new speakers. We address false alarm rate extrapolation under a worst-case model whereby an adversary identifies the closest impostor for a given target speaker from a large population. Our models are generative and allow sampling new speakers. The models are formulated in the ASV detection score space to facilitate analysis of arbitrary ASV systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
