Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Zheng Chen; Mingde Zhou; Jinpei Guo; Jiale Yuan; Yifei Ji; Yulun Zhang

arXiv:2508.04979·cs.CV·December 3, 2025

Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Zheng Chen, Mingde Zhou, Jinpei Guo, Jiale Yuan, Yifei Ji, Yulun Zhang

PDF

Open Access 1 Video

TL;DR

SODEC is a single-step diffusion-based image compression model that significantly reduces decoding latency and improves fidelity by leveraging rich latent representations and fidelity guidance, outperforming existing methods.

Contribution

The paper introduces SODEC, a novel single-step diffusion model for image compression that combines a pre-trained VAE, fidelity guidance, and rate annealing for superior performance.

Findings

01

Decoding speed improved by over 20 times.

02

Achieves better rate-distortion-perception performance.

03

Outperforms existing diffusion-based compression methods.

Abstract

Diffusion-based image compression has demonstrated impressive perceptual performance. However, it suffers from two critical drawbacks: (1) excessive decoding latency due to multi-step sampling, and (2) poor fidelity resulting from over-reliance on generative priors. To address these issues, we propose SODEC, a novel single-step diffusion image compression model. We argue that in image compression, a sufficiently informative latent renders multi-step refinement unnecessary. Based on this insight, we leverage a pre-trained VAE-based model to produce latents with rich information, and replace the iterative denoising process with a single-step decoding. Meanwhile, to improve fidelity, we introduce the fidelity guidance module, encouraging output that is faithful to the original image. Furthermore, we design the rate annealing training strategy to enable effective training under extremely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression· underline

Taxonomy

TopicsAdvanced Data Compression Techniques · Image and Video Quality Assessment · Advanced Image Processing Techniques