Generative Preprocessing for Image Compression with Pre-trained Diffusion Models
Mengxi Guo, Shijie Zhao, Junlin Li, Li Zhang

TL;DR
This paper introduces a novel image preprocessing method using pre-trained diffusion models to optimize compression based on perceptual quality, achieving significant rate-perception improvements without altering standard codecs.
Contribution
It pioneers the adaptation of large-scale pre-trained diffusion models for compression preprocessing, combining distillation and fine-tuning for enhanced perceptual compression.
Findings
Up to 30.13% BD-rate reduction in DISTS on Kodak dataset
Improved subjective visual quality
Seamless integration with standard codecs
Abstract
Preprocessing is a well-established technique for optimizing compression, yet existing methods are predominantly Rate-Distortion (R-D) optimized and constrained by pixel-level fidelity. This work pioneers a shift towards Rate-Perception (R-P) optimization by, for the first time, adapting a large-scale pre-trained diffusion model for compression preprocessing. We propose a two-stage framework: first, we distill the multi-step Stable Diffusion 2.1 into a compact, one-step image-to-image model using Consistent Score Identity Distillation (CiD). Second, we perform a parameter-efficient fine-tuning of the distilled model's attention modules, guided by a Rate-Perception loss and a differentiable codec surrogate. Our method seamlessly integrates with standard codecs without any modification and leverages the model's powerful generative priors to enhance texture and mitigate artifacts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
