Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

TL;DR
This paper introduces a speech-dependent model for own voice transfer in hearables with in-ear microphones, improving simulation accuracy over speech-independent models and demonstrating better generalization across talkers.
Contribution
It proposes a novel phoneme-based linear time-invariant transfer function model for own voice transfer characteristics in hearables, accounting for speech content and individual differences.
Findings
Speech-dependent models outperform speech-independent models in simulation accuracy.
Talker-averaged models generalize better to new talkers.
Experimental validation with prototype hearables confirms improved modeling performance.
Abstract
Many hearables contain an in-ear microphone, which may be used to capture the own voice of its user. However, due to the hearable occluding the ear canal, the in-ear microphone mostly records body-conducted speech, typically suffering from band-limitation effects and amplification at low frequencies. Since the occlusion effect is determined by the ratio between the air-conducted and body-conducted components of own voice, the own voice transfer characteristics between the outer face of the hearable and the in-ear microphone depend on the speech content and the individual talker. In this paper, we propose a speech-dependent model of the own voice transfer characteristics based on phoneme recognition, assuming a linear time-invariant relative transfer function for each phoneme. We consider both individual models as well as models averaged over several talkers. Experimental results based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing
