Enhancement Of Coded Speech Using a Mask-Based Post-Filter

Srikanth Korse; Kishan Gupta; Guillaume Fuchs

arXiv:2010.05571·eess.AS·October 13, 2020

Enhancement Of Coded Speech Using a Mask-Based Post-Filter

Srikanth Korse, Kishan Gupta, Guillaume Fuchs

PDF

TL;DR

This paper introduces a neural network-based post-filter that enhances low-bitrate coded speech by estimating time-frequency masks, outperforming traditional heuristic methods in both objective and subjective evaluations.

Contribution

It proposes a data-driven, mask-based post-filter using neural networks for speech enhancement at low bitrates, demonstrating superior performance over conventional filters.

Findings

01

Neural network post-filters improve speech quality at low bitrates.

02

Mask-based neural models outperform heuristic post-filters.

03

Objective and subjective tests confirm the effectiveness of the proposed method.

Abstract

The quality of speech codecs deteriorates at low bitrates due to high quantization noise. A post-filter is generally employed to enhance the quality of the coded speech. In this paper, a data-driven post-filter relying on masking in the time-frequency domain is proposed. A fully connected neural network (FCNN), a convolutional encoder-decoder (CED) network and a long short-term memory (LSTM) network are implemeted to estimate a real-valued mask per time-frequency bin. The proposed models were tested on the five lowest operating modes (6.65 kbps-15.85 kbps) of the Adaptive Multi-Rate Wideband codec (AMR-WB). Both objective and subjective evaluations confirm the enhancement of the coded speech and also show the superiority of the mask-based neural network system over a conventional heuristic post-filter used in the standard like ITU-T G.718.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.