Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
Sha Guo, Zhuo Chen, Yang Zhao, Ning Zhang, Xiaotong Li, Lingyu Duan

TL;DR
This paper presents a scalable, content-adaptive diffusion-based image compression method that improves perceptual quality and preserves features for machine vision tasks, outperforming existing codecs without retraining.
Contribution
Introduces a novel diffusion-based framework for scalable image compression that captures textures and semantics adaptively, enabling flexible compression control and better downstream task performance.
Findings
Enhanced perceptual quality over state-of-the-art methods
Effective preservation of textures and semantics for machine vision
Flexible control over compression ratios without retraining
Abstract
Traditional image codecs emphasize signal fidelity and human perception, often at the expense of machine vision tasks. Deep learning methods have demonstrated promising coding performance by utilizing rich semantic embeddings optimized for both human and machine vision. However, these compact embeddings struggle to capture fine details such as contours and textures, resulting in imperfect reconstructions. Furthermore, existing learning-based codecs lack scalability. To address these limitations, this paper introduces a content-adaptive diffusion model for scalable image compression. The proposed method encodes fine textures through a diffusion process, enhancing perceptual quality while preserving essential features for machine vision tasks. The approach employs a Markov palette diffusion model combined with widely used feature extractors and image generators, enabling efficient data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
