Towards a generalized monaural and binaural auditory model for psychoacoustics and speech intelligibility
Thomas Biberger, Stephan D. Ewert

TL;DR
This paper introduces a unified auditory model that combines monaural and binaural cues using a simplified, physiologically inspired processing stage, enhancing psychoacoustic and speech intelligibility analysis.
Contribution
It extends a monaural envelope power spectrum model with a fixed, physiologically motivated binaural stage, creating a unified framework for monaural and binaural auditory modeling.
Findings
The model effectively integrates monaural and binaural cues.
It reproduces psychoacoustic experiment results from literature.
The approach simplifies binaural processing while maintaining physiological relevance.
Abstract
Auditory perception involves cues in the monaural auditory pathways as well as binaural cues based on differences between the ears. So far auditory models have often focused on either monaural or binaural experiments in isolation. Although binaural models typically build upon stages of (existing) monaural models, only a few attempts have been made to extend a monaural model by a binaural stage using a unified decision stage for monaural and binaural cues. In such approaches, a typical prototype of binaural processing has been the classical equalization-cancelation mechanism, which either involves signal-adaptive delays and provides a single channel output or can be implemented with tapped delays providing a high-dimensional multichannel output. This contribution extends the (monaural) generalized envelope power spectrum model by a non-adaptive binaural stage with only a few, fixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
