Scalable and Efficient Neural Speech Coding: A Hybrid Design
Kai Zhen, Jongmo Sung, Mi Suk Lee, Seungkwon Beak, Minje Kim

TL;DR
This paper introduces a scalable, efficient neural speech coding system combining neural waveform codecs with residual learning and hybrid LPC integration, achieving competitive compression quality at various bitrates.
Contribution
The work presents a novel hybrid neural speech codec with scalable architecture and trainable quantization, improving efficiency and flexibility over prior neural speech coding methods.
Findings
Achieves comparable or superior quality to AMR-WB and Opus at low and medium bitrates.
Reduces decoding complexity compared to other neural speech coders.
Supports scalable bitrates through residual learning and hybrid LPC integration.
Abstract
We present a scalable and efficient neural waveform coding system for speech compression. We formulate the speech coding problem as an autoencoding task, where a convolutional neural network (CNN) performs encoding and decoding as a neural waveform codec (NWC) during its feedforward routine. The proposed NWC also defines quantization and entropy coding as a trainable module, so the coding artifacts and bitrate control are handled during the optimization process. We achieve efficiency by introducing compact model components to NWC, such as gated residual networks and depthwise separable convolution. Furthermore, the proposed models are with a scalable architecture, cross-module residual learning (CMRL), to cover a wide range of bitrates. To this end, we employ the residual coding concept to concatenate multiple NWC autoencoding modules, where each NWC module performs residual coding to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDilated Causal Convolution · Mixture of Logistic Distributions · Solana Customer Service Number +1-833-534-1729 · WaveNet
