Conditional Video Generation for High-Efficiency Video Compression
Fangqiu Yi, Jingyu Xu, Jiawei Shao, Chi Zhang, Xuelong Li

TL;DR
This paper introduces a novel video compression framework using conditional diffusion models that optimize perceptual quality by synthesizing videos from sparse signals, outperforming traditional codecs especially at high compression ratios.
Contribution
It proposes a new approach framing video compression as a conditional generation task with multi-granular conditioning and robust training strategies.
Findings
Outperforms traditional and neural codecs on perceptual metrics
Achieves higher perceptual quality at high compression ratios
Demonstrates robustness through multi-condition training
Abstract
Perceptual studies demonstrate that conditional diffusion models excel at reconstructing video content aligned with human visual perception. Building on this insight, we propose a video compression framework that leverages conditional diffusion models for perceptually optimized reconstruction. Specifically, we reframe video compression as a conditional generation task, where a generative model synthesizes video from sparse, yet informative signals. Our approach introduces three key modules: (1) Multi-granular conditioning that captures both static scene structure and dynamic spatio-temporal cues; (2) Compact representations designed for efficient transmission without sacrificing semantic richness; (3) Multi-condition training with modality dropout and role-aware embeddings, which prevent over-reliance on any single modality and enhance robustness. Extensive experiments show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Generative Adversarial Networks and Image Synthesis · Advanced Data Compression Techniques
