CaDM: Codec-aware Diffusion Modeling for Neural-enhanced Video Streaming
Qihua Zhou, Ruibin Li, Song Guo, Peiran Dong, Yi Liu, Jingcai Guo,, Zhenda Xu

TL;DR
CaDM introduces a codec-aware diffusion model that co-designs encoder and decoder for neural-enhanced video streaming, significantly reducing bitrate while improving restoration quality by leveraging diffusion models' generative capacity.
Contribution
This work pioneers the integration of encoder-decoder synergy using diffusion models to optimize neural-enhanced video streaming, addressing prior limitations in rate-distortion trade-offs.
Findings
Achieves up to 21.44x bitrate savings compared to existing methods.
Provides superior video restoration quality with a FID of 0.61.
Demonstrates effective performance on public cloud benchmarks.
Abstract
Recent years have witnessed the dramatic growth of Internet video traffic, where the video bitstreams are often compressed and delivered in low quality to fit the streamer's uplink bandwidth. To alleviate the quality degradation, it comes the rise of Neural-enhanced Video Streaming (NVS), which shows great prospects for recovering low-quality videos by mostly deploying neural super-resolution (SR) on the media server. Despite its benefit, we reveal that current mainstream works with SR enhancement have not achieved the desired rate-distortion trade-off between bitrate saving and quality restoration, due to: (1) overemphasizing the enhancement on the decoder side while omitting the co-design of encoder, (2) limited generative capacity to recover high-fidelity perceptual details, and (3) optimizing the compression-and-restoration pipeline from the resolution perspective solely, without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging
MethodsDiffusion · Attentive Walk-Aggregating Graph Neural Network
