Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids
Mathuranathan Mayuravaani, W. Bastiaan Kleijn, Andrew Lensen, and Charlotte S{\o}rensen

TL;DR
This paper introduces a simulation-based, single-microphone method for own voice detection in hearing aids, using transfer function simulations and a transformer classifier to achieve high accuracy and generalize to real-world data.
Contribution
It proposes a novel data augmentation strategy with simulated transfer functions and a hierarchical training approach for ML-based own voice detection in hearing aids.
Findings
95.52% accuracy on simulated head-and-torso data
90.02% accuracy with short utterances
80% accuracy on real recordings without fine-tuning
Abstract
This paper presents a simulation-based approach to own voice detection (OVD) in hearing aids using a single microphone. While OVD can significantly improve user comfort and speech intelligibility, existing solutions often rely on multiple microphones or additional sensors, increasing device complexity and cost. To enable ML-based OVD without requiring costly transfer-function measurements, we propose a data augmentation strategy based on simulated acoustic transfer functions (ATFs) that expose the model to a wide range of spatial propagation conditions. A transformer-based classifier is first trained on analytically generated ATFs and then progressively fine-tuned using numerically simulated ATFs, transitioning from a rigid-sphere model to a detailed head-and-torso representation. This hierarchical adaptation enabled the model to refine its spatial understanding while maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Speech Recognition and Synthesis
