(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test
Stefan Bleeck

TL;DR
The paper presents SimPhon Speech Test, a novel in silico method using ASR to design and validate a phonetically balanced speech test that captures perceptual deficits beyond traditional measures.
Contribution
It introduces a data-driven, multi-stage pipeline leveraging ASR for creating and validating a minimal-pair speech test, advancing audiological diagnostics.
Findings
The test items' performance does not correlate with the Speech Intelligibility Index.
The methodology efficiently reduces candidate items to an optimized set of 25 pairs.
The test captures perceptual deficits beyond simple audibility measures.
Abstract
Traditional audiometry often provides an incomplete characterization of the functional impact of hearing loss on speech understanding, particularly for supra-threshold deficits common in presbycusis. This motivates the development of more diagnostically specific speech perception tests. We introduce the Simulated Phoneme Speech Test (SimPhon Speech Test) methodology, a novel, multi-stage computational pipeline for the in silico design and validation of a phonetically balanced minimal-pair speech test. This methodology leverages a modern Automatic Speech Recognition (ASR) system as a proxy for a human listener to simulate the perceptual effects of sensorineural hearing loss. By processing speech stimuli under controlled acoustic degradation, we first identify the most common phoneme confusion patterns. These patterns then guide the data-driven curation of a large set of candidate word…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsSparse Evolutionary Training
