# The Effect of Modulation Enhancement Scheme on Speech Recognition in Spatial Noise Among Young Adults with Normal Hearing

**Authors:** Vibha Kanagokar, Yashu MA, Jayashree S. Bhat, Arivudai Nambi Pitchaimuthu

PMC · DOI: 10.3390/audiolres16010026 · Audiology Research · 2026-02-14

## TL;DR

This study shows that enhancing the temporal envelope of speech improves speech recognition in noisy environments for young adults with normal hearing.

## Contribution

The study demonstrates that envelope enhancement improves spatial unmasking and binaural timing cues in normal-hearing listeners.

## Key findings

- ENV enhancement significantly increased spatial release from masking across all SNRs and spatial separations.
- Benefits of ENV enhancement were greatest at lower SNRs and wider target-masker separations.
- Enhanced interaural coherence and more reliable ITD estimates were observed under the expanded condition.

## Abstract

Background/Objectives: Speech understanding in noise relies on both temporal fine structure (TFS) and temporal envelope (ENV) cues. While TFS primarily conveys interaural time differences (ITDs) at low frequencies, ENV cues can also support ITD processing, especially when TFS is unavailable or degraded. Expanding the ENV by increasing modulation depth has been proposed to improve speech perception, but its effects on spatial release from masking (SRM) and binaural temporal processing in normal-hearing listeners remain unclear. The goal of this study was to evaluate the effect of ENV enhancement on SRM in young adults with normal hearing and its influence on ITD sensitivity and interaural coherence (IC). Method: Thirty normal-hearing native Kannada speakers (19–34 years) participated. Speech stimuli consisted of Kannada sentences embedded in four-talker babble at −5, 0, and +5 dB signal to noise ratio (SNR). Target and masker were spatialized using head-related transfer functions at 0°, 15°, and 37.5° azimuths. Stimuli were presented with and without ENV enhancement (compression–expansion algorithm). Speech recognition scores were analyzed using generalized linear mixed models, and SRM was calculated as performance differences between co-located and spatially separated conditions. Cross-correlation analyses were performed to estimate ITDs and IC across SNRs. Result: ENV enhancement yielded significantly higher SRM values across all SNRs and spatial separations. Benefits were greatest at lower SNRs and wider target–masker separations. Cross-correlation analysis showed enhanced IC and more reliable ITD estimates under the expanded condition, particularly at moderate SNRs. Conclusions: Temporal ENV enhancement strengthens spatial unmasking and binaural timing cues in normal-hearing adults, especially under adverse listening conditions. These findings highlight its potential application in auditory rehabilitation and hearing technologies where ENV cues are critical.

## Full-text entities

- **Diseases:** SRM (MESH:D059468), outer (MESH:C538223), cell dysfunction (MESH:D002292), injury to (MESH:D014947), loss of compression (MESH:D009408), sensorineural hearing loss (MESH:D006319), hearing impairment (MESH:D034381), auditory neuropathy (MESH:C538268), fatigue (MESH:D005221)
- **Chemicals:** DHARMa (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12921754/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12921754/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/PMC12921754/full.md

---
Source: https://tomesphere.com/paper/PMC12921754