Modeling of Speech-dependent Own Voice Transfer Characteristics for   Hearables with In-ear Microphones

Mattes Ohlenbusch; Christian Rollwage; Simon Doclo

arXiv:2310.06554·eess.AS·September 9, 2024·1 cites

Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones

Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

PDF

Open Access

TL;DR

This paper introduces a speech-dependent model for own voice transfer in hearables with in-ear microphones, improving simulation accuracy over speech-independent models and demonstrating better generalization across talkers.

Contribution

It proposes a novel phoneme-based linear time-invariant transfer function model for own voice transfer characteristics in hearables, accounting for speech content and individual differences.

Findings

01

Speech-dependent models outperform speech-independent models in simulation accuracy.

02

Talker-averaged models generalize better to new talkers.

03

Experimental validation with prototype hearables confirms improved modeling performance.

Abstract

Many hearables contain an in-ear microphone, which may be used to capture the own voice of its user. However, due to the hearable occluding the ear canal, the in-ear microphone mostly records body-conducted speech, typically suffering from band-limitation effects and amplification at low frequencies. Since the occlusion effect is determined by the ratio between the air-conducted and body-conducted components of own voice, the own voice transfer characteristics between the outer face of the hearable and the in-ear microphone depend on the speech content and the individual talker. In this paper, we propose a speech-dependent model of the own voice transfer characteristics based on phoneme recognition, assuming a linear time-invariant relative transfer function for each phoneme. We consider both individual models as well as models averaged over several talkers. Experimental results based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing