Diffusion-based Perceptual Neural Video Compression with Temporal   Diffusion Information Reuse

Wenzhuo Ma; Zhenzhong Chen

arXiv:2501.13528·cs.CV·January 24, 2025

Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse

Wenzhuo Ma, Zhenzhong Chen

PDF

Open Access

TL;DR

This paper introduces DiffVC, a diffusion-based neural video compression framework that leverages temporal information reuse and quantization parameter prompting to improve efficiency and quality across variable bitrates.

Contribution

The paper presents a novel diffusion-based video compression method with a temporal diffusion information reuse strategy and a quantization parameter prompting mechanism, enhancing efficiency and adaptability.

Findings

01

Achieves high perceptual quality and visual fidelity in video compression.

02

Significantly improves inference efficiency with minimal performance loss.

03

Effective across various bitrates with robust quality.

Abstract

Recently, foundational diffusion models have attracted considerable attention in image compression tasks, whereas their application to video compression remains largely unexplored. In this article, we introduce DiffVC, a diffusion-based perceptual neural video compression framework that effectively integrates foundational diffusion model with the video conditional coding paradigm. This framework uses temporal context from previously decoded frame and the reconstructed latent representation of the current frame to guide the diffusion model in generating high-quality results. To accelerate the iterative inference process of diffusion model, we propose the Temporal Diffusion Information Reuse (TDIR) strategy, which significantly enhances inference efficiency with minimal performance loss by reusing the diffusion information from previous frames. Additionally, to address the challenges…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Image and Video Stabilization · Advanced Data Compression Techniques

MethodsSoftmax · Attention Is All You Need · Diffusion