Adaptive hybrid speech coding with a MLP LPC structure

Marcos Faundez-Zanuy

arXiv:2204.00245·cs.SD·April 4, 2022

Adaptive hybrid speech coding with a MLP LPC structure

Marcos Faundez-Zanuy

PDF

TL;DR

This paper introduces an adaptive hybrid speech coding scheme combining linear predictive coding and neural networks, achieving improved speech quality with lower bitrates.

Contribution

It presents a novel combined linear/nonlinear predictive model for speech coding that enhances performance over traditional methods.

Findings

01

Improved SEGSNR by 1 to 2.5 dB with adaptive quantization

02

Effective integration of LPC and neural nets in speech coding

03

Demonstrated benefits of nonlinear models in real applications

Abstract

In the last years there has been a growing interest for nonlinear speech models. Several works have been published revealing the better performance of nonlinear techniques, but little attention has been dedicated to the implementation of the nonlinear model into real applications. This work is focused on the study of the behaviour of a combined linear/nonlinear predictive model based on linear predictive coding (LPC-10) and neural nets, in a speech waveform coder. Our novel scheme obtains an improvement in SEGSNR between 1 and 2.5 dB for an adaptive quantization ranging from 2 to 5 bits.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.