Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression
Zheng Chen, Mingde Zhou, Jinpei Guo, Jiale Yuan, Yifei Ji, Yulun Zhang

TL;DR
SODEC is a single-step diffusion-based image compression model that significantly reduces decoding latency and improves fidelity by leveraging rich latent representations and fidelity guidance, outperforming existing methods.
Contribution
The paper introduces SODEC, a novel single-step diffusion model for image compression that combines a pre-trained VAE, fidelity guidance, and rate annealing for superior performance.
Findings
Decoding speed improved by over 20 times.
Achieves better rate-distortion-perception performance.
Outperforms existing diffusion-based compression methods.
Abstract
Diffusion-based image compression has demonstrated impressive perceptual performance. However, it suffers from two critical drawbacks: (1) excessive decoding latency due to multi-step sampling, and (2) poor fidelity resulting from over-reliance on generative priors. To address these issues, we propose SODEC, a novel single-step diffusion image compression model. We argue that in image compression, a sufficiently informative latent renders multi-step refinement unnecessary. Based on this insight, we leverage a pre-trained VAE-based model to produce latents with rich information, and replace the iterative denoising process with a single-step decoding. Meanwhile, to improve fidelity, we introduce the fidelity guidance module, encouraging output that is faithful to the original image. Furthermore, we design the rate annealing training strategy to enable effective training under extremely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Data Compression Techniques · Image and Video Quality Assessment · Advanced Image Processing Techniques
