PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech   Enhancement

Xiaofeng Ge; Jiangyu Han; Yanhua Long; Haixin Guan

arXiv:2203.02263·eess.AS·March 7, 2022·1 cites

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan

PDF

Open Access

TL;DR

PercepNet+ enhances real-time speech quality by integrating phase awareness, SNR estimation, and advanced neural modeling, significantly outperforming the original PercepNet in speech quality metrics.

Contribution

This work introduces PercepNet+ with phase-aware features, SNR estimation, TF-GRU, and multi-objective loss, advancing real-time speech enhancement techniques.

Findings

01

PercepNet+ significantly improves PESQ and STOI scores.

02

The phase-aware structure enhances speech quality.

03

The model maintains efficiency with minimal size increase.

Abstract

PercepNet, a recent extension of the RNNoise, an efficient, high-quality and real-time full-band speech enhancement technique, has shown promising performance in various public deep noise suppression tasks. This paper proposes a new approach, named PercepNet+, to further extend the PercepNet with four significant improvements. First, we introduce a phase-aware structure to leverage the phase information into PercepNet, by adding the complex features and complex subband gains as the deep network input and output respectively. Then, a signal-to-noise ratio (SNR) estimator and an SNR switched post-processing are specially designed to alleviate the over attenuation (OA) that appears in high SNR conditions of the original PercepNet. Moreover, the GRU layer is replaced by TF-GRU to model both temporal and frequency dependencies. Finally, we propose to integrate the loss of complex subband…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Hearing Loss and Rehabilitation

MethodsGated Recurrent Unit