Joint Far- and Near-End Speech Intelligibility Enhancement based on the   Approximated Speech Intelligibility Index

Andreas Jonas Fuglsig; Jan {\O}stergaard; Jesper Jensen; Lars; S{\o}ndergaard Bertelsen; Peter Mariager; Zheng-Hua Tan

arXiv:2111.07759·eess.AS·May 6, 2022

Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index

Andreas Jonas Fuglsig, Jan {\O}stergaard, Jesper Jensen, Lars, S{\o}ndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan

PDF

TL;DR

This paper introduces a simplified approach to joint far- and near-end speech enhancement using the Approximated Speech Intelligibility Index, achieving effective results without complex modeling of speech variations.

Contribution

It proposes a closed-form solution for joint speech enhancement based on ASII, avoiding complex models and assumptions of natural speech variations.

Findings

01

Achieves similar performance to existing methods

02

Does not require modeling natural speech variations

03

Provides a closed-form, distribution-independent solution

Abstract

This paper considers speech enhancement of signals picked up in one noisy environment which must be presented to a listener in another noisy environment. Recently, it has been shown that an optimal solution to this problem requires the consideration of the noise sources in both environments jointly. However, the existing optimal mutual information based method requires a complicated system model that includes natural speech variations, and relies on approximations and assumptions of the underlying signal distributions. In this paper, we propose to use a simpler signal model and optimize speech intelligibility based on the Approximated Speech Intelligibility Index (ASII). We derive a closed-form solution to the joint far- and near-end speech enhancement problem that is independent of the marginal distribution of signal coefficients, and that achieves similar performance to existing work.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.