ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems
Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong, Chong, Mei Kuan Lim, David Lo

TL;DR
ASDF is a novel differential testing framework for ASR systems that generates diverse test cases using text transformations and phonetic analysis, helping developers identify and improve error-prone phonemes.
Contribution
This paper introduces ASDF, which enhances existing testing tools with text transformation and phonetic analysis to generate more test cases and provide actionable insights.
Findings
ASDF generates more high-quality test cases than previous tools.
It identifies phonemes where ASR systems tend to make errors.
ASDF offers comprehensive metrics for ASR performance evaluation.
Abstract
Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems and (2) a phonetic analysis module to identify on which phonemes the ASR system tend to produce errors. ASDF generates more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques
MethodsTest
