AB/BA analysis: A framework for estimating keyword spotting recall   improvement while maintaining audio privacy

Raphael Petegrosso; Vasistakrishna Baderdinni; Thibaud Senechal,; Benjamin L. Bullough

arXiv:2204.08474·cs.SD·April 20, 2022

AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy

Raphael Petegrosso, Vasistakrishna Baderdinni, Thibaud Senechal,, Benjamin L. Bullough

PDF

Open Access

TL;DR

This paper introduces AB/BA analysis, a privacy-preserving evaluation framework for keyword spotting systems that estimates recall improvements without requiring negative samples or compromising privacy.

Contribution

The paper presents a novel AB/BA analysis framework for evaluating KWS models, including methods for estimating recall and false positive rates under privacy constraints.

Findings

01

AB/BA analysis accurately measures recall improvements.

02

The framework estimates false positive rate with low variance.

03

Semi-supervised extension enhances analysis efficiency and privacy.

Abstract

Evaluation of keyword spotting (KWS) systems that detect keywords in speech is a challenging task under realistic privacy constraints. The KWS is designed to only collect data when the keyword is present, limiting the availability of hard samples that may contain false negatives, and preventing direct estimation of model recall from production data. Alternatively, complementary data collected from other sources may not be fully representative of the real application. In this work, we propose an evaluation technique which we call AB/BA analysis. Our framework evaluates a candidate KWS model B against a baseline model A, using cross-dataset offline decoding for relative recall estimation, without requiring negative examples. Moreover, we propose a formulation with assumptions that allow estimation of relative false positive rate between models with low variance even when the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Speech and Audio Processing