Improving Speech Enhancement through Fine-Grained Speech Characteristics
Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe,, Bhiksha Raj

TL;DR
This paper introduces a novel method for speech enhancement that optimizes key perceptual speech features to improve naturalness and quality, applicable to existing deep learning systems, validated on the DNS dataset.
Contribution
It proposes a new perceptually motivated objective function based on differentiable estimators of acoustic features for fine-tuning speech enhancement models.
Findings
Improved speech naturalness and quality in enhanced signals.
Enhanced deep learning systems outperform state-of-the-art on DNS dataset.
Method is generic and applicable to various existing systems.
Abstract
While deep learning based speech enhancement systems have made rapid progress in improving the quality of speech signals, they can still produce outputs that contain artifacts and can sound unnatural. We propose a novel approach to speech enhancement aimed at improving perceptual quality and naturalness of enhanced signals by optimizing for key characteristics of speech. We first identify key acoustic parameters that have been found to correlate well with voice quality (e.g. jitter, shimmer, and spectral flux) and then propose objective functions which are aimed at reducing the difference between clean speech and enhanced speech with respect to these features. The full set of acoustic features is the extended Geneva Acoustic Parameter Set (eGeMAPS), which includes 25 different attributes associated with perception of speech. Given the non-differentiable nature of these feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Infant Health and Development
