Adversarial Auto-Encoding for Packet Loss Concealment
Santiago Pascual, Joan Serr\`a, Jordi Pons

TL;DR
This paper introduces PLAAE, a non-autoregressive adversarial auto-encoder for real-time packet loss concealment in voice communication, outperforming existing models in quality and coherence.
Contribution
The paper presents a novel non-autoregressive adversarial auto-encoder architecture for waveform domain packet loss concealment, enabling real-time, smooth signal reconstruction.
Findings
PLAAE outperforms classic PLCs and autoregressive models in spectral and intonation reconstruction.
PLAAE achieves higher perceptual quality and intelligibility in reconstructed speech.
The model predicts coherent signal continuations in a single feed-forward step.
Abstract
Communication technologies like voice over IP operate under constrained real-time conditions, with voice packets being subject to delays and losses from the network. In such cases, the packet loss concealment (PLC) algorithm reconstructs missing frames until a new real packet is received. Recently, autoregressive deep neural networks have been shown to surpass the quality of signal processing methods for PLC, specially for long-term predictions beyond 60 ms. In this work, we propose a non-autoregressive adversarial auto-encoder, named PLAAE, to perform real-time PLC in the waveform domain. PLAAE has a causal convolutional structure, and it learns in an auto-encoder fashion to reconstruct signals with gaps, with the help of an adversarial loss. During inference, it is able to predict smooth and coherent continuations of such gaps in a single feed-forward step, as opposed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
