Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms
Joseph Konan, Shikhar Agnihotri, Ojas Bhargave, Shuo Han, Yunyang, Zeng, Ankit Shah, Bhiksha Raj

TL;DR
This paper investigates the psychoacoustic effects of proprietary denoising algorithms on VoIP speech quality, using innovative econometric analysis and psychoacoustic metrics across popular platforms like Google Meets and Zoom.
Contribution
It introduces a novel application of Blinder-Oaxaca decomposition to analyze acoustic perturbations in VoIP, and provides comprehensive benchmarking of speech enhancement models in this context.
Findings
Proprietary denoising impacts speech quality and intelligibility.
Psychoacoustic metrics reveal perceptual distortions caused by VoIP processing.
Benchmarking shows variability in enhancement model performance.
Abstract
Within the ambit of VoIP (Voice over Internet Protocol) telecommunications, the complexities introduced by acoustic transformations merit rigorous analysis. This research, rooted in the exploration of proprietary sender-side denoising effects, meticulously evaluates platforms such as Google Meets and Zoom. The study draws upon the Deep Noise Suppression (DNS) 2020 dataset, ensuring a structured examination tailored to various denoising settings and receiver interfaces. A methodological novelty is introduced via Blinder-Oaxaca decomposition, traditionally an econometric tool, repurposed herein to analyze acoustic-phonetic perturbations within VoIP systems. To further ground the implications of these transformations, psychoacoustic metrics, specifically PESQ and STOI, were used to explain of perceptual quality and intelligibility. Cumulatively, the insights garnered underscore the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Acoustic Wave Phenomena Research
