Speech-dependent Modeling of Own Voice Transfer Characteristics for   In-ear Microphones in Hearables

Mattes Ohlenbusch; Christian Rollwage; Simon Doclo

arXiv:2309.08294·eess.AS·March 25, 2024

Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables

Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

PDF

Open Access

TL;DR

This paper introduces a speech-dependent model for simulating in-ear microphone signals in hearables, improving the accuracy of voice transfer characteristic modeling by incorporating phoneme recognition, which enhances signal processing in noisy environments.

Contribution

It proposes a novel speech-dependent system identification approach based on phoneme recognition for modeling in-ear microphone transfer characteristics, outperforming speech-independent models.

Findings

01

Speech-dependent model yields more accurate in-ear voice simulations.

02

Model generalizes well across different talkers.

03

Speech-dependent approach outperforms speech-independent methods.

Abstract

Many hearables contain an in-ear microphone, which may be used to capture the own voice of its user in noisy environments. Since the in-ear microphone mostly records body-conducted speech due to ear canal occlusion, it suffers from band-limitation effects while only capturing a limited amount of external noise. To enhance the quality of the in-ear microphone signal using algorithms aiming at joint bandwidth extension, equalization, and noise reduction, it is desirable to have an accurate model of the own voice transfer characteristics between the entrance of the ear canal and the in-ear microphone. Such a model can be used, e.g., to simulate a large amount of in-ear recordings to train supervised learning-based algorithms. Since previous research on ear canal occlusion suggests that own voice transfer characteristics depend on speech content, in this contribution we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques