(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test

Stefan Bleeck

arXiv:2506.11620·cs.SD·June 16, 2025

(SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test

Stefan Bleeck

PDF

Open Access

TL;DR

The paper presents SimPhon Speech Test, a novel in silico method using ASR to design and validate a phonetically balanced speech test that captures perceptual deficits beyond traditional measures.

Contribution

It introduces a data-driven, multi-stage pipeline leveraging ASR for creating and validating a minimal-pair speech test, advancing audiological diagnostics.

Findings

01

The test items' performance does not correlate with the Speech Intelligibility Index.

02

The methodology efficiently reduces candidate items to an optimized set of 25 pairs.

03

The test captures perceptual deficits beyond simple audibility measures.

Abstract

Traditional audiometry often provides an incomplete characterization of the functional impact of hearing loss on speech understanding, particularly for supra-threshold deficits common in presbycusis. This motivates the development of more diagnostically specific speech perception tests. We introduce the Simulated Phoneme Speech Test (SimPhon Speech Test) methodology, a novel, multi-stage computational pipeline for the in silico design and validation of a phonetically balanced minimal-pair speech test. This methodology leverages a modern Automatic Speech Recognition (ASR) system as a proxy for a human listener to simulate the perceptual effects of sensorineural hearing loss. By processing speech stimuli under controlled acoustic degradation, we first identify the most common phoneme confusion patterns. These patterns then guide the data-driven curation of a large set of candidate word…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsSparse Evolutionary Training