# A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet

**Authors:** Jean-Marc Valin, Jan Skoglund

arXiv: 1903.12087 · 2019-07-01

## TL;DR

This paper introduces a low-bitrate neural vocoder based on LPCNet that operates in real-time, achieving higher speech quality than traditional codecs at 1.6 kb/s by leveraging linear prediction and sparse networks.

## Contribution

The paper presents a novel neural vocoder using LPCNet that combines linear prediction and sparsity to enable real-time low-bitrate speech synthesis.

## Key findings

- LPCNet at 1.6 kb/s outperforms MELP in quality.
- Uncompressed LPCNet exceeds low-bitrate waveform codec quality.
- Real-time operation achieved on general-purpose hardware.

## Abstract

Neural speech synthesis algorithms are a promising new approach for coding speech at very low bitrate. They have so far demonstrated quality that far exceeds traditional vocoders, at the cost of very high complexity. In this work, we present a low-bitrate neural vocoder based on the LPCNet model. The use of linear prediction and sparse recurrent networks makes it possible to achieve real-time operation on general-purpose hardware. We demonstrate that LPCNet operating at 1.6 kb/s achieves significantly higher quality than MELP and that uncompressed LPCNet can exceed the quality of a waveform codec operating at low bitrate. This opens the way for new codec designs based on neural synthesis models.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.12087/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1903.12087/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1903.12087/full.md

---
Source: https://tomesphere.com/paper/1903.12087