Extreme Video Compression with Pre-trained Diffusion Models

Bohan Li; Yiming Liu; Xueyan Niu; Bo Bai; Lei Deng; and Deniz; G\"und\"uz

arXiv:2402.08934·eess.IV·February 15, 2024·1 cites

Extreme Video Compression with Pre-trained Diffusion Models

Bohan Li, Yiming Liu, Xueyan Niu, Bo Bai, Lei Deng, and Deniz, G\"und\"uz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel extreme video compression method using pre-trained diffusion models at the decoder, achieving high perceptual quality at very low bit rates by sequentially predicting frames and restarting when quality drops.

Contribution

The paper presents a new approach leveraging diffusion-based generative models for low-bpp video compression, demonstrating effective frame prediction and quality maintenance.

Findings

01

Achieves perceptually high-quality video reconstruction at 0.02 bpp.

02

Outperforms standard codecs like H.264 and H.265 in low bit rate regimes.

03

Uses temporal relations in video data with generative models.

Abstract

Diffusion models have achieved remarkable success in generating high quality image and video data. More recently, they have also been used for image compression with high perceptual quality. In this paper, we present a novel approach to extreme video compression leveraging the predictive power of diffusion-based generative models at the decoder. The conditional diffusion model takes several neural compressed frames and generates subsequent frames. When the reconstruction quality drops below the desired level, new frames are encoded to restart prediction. The entire video is sequentially encoded to achieve a visually pleasing reconstruction, considering perceptual quality metrics such as the learned perceptual image patch similarity (LPIPS) and the Frechet video distance (FVD), at bit rates as low as 0.02 bits per pixel (bpp). Experimental results demonstrate the effectiveness of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

elesionkyrie/extreme-video-compression-with-prediction-using-pre-trainded-diffusion-models-
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Image and Signal Denoising Methods

MethodsDiffusion