Toward Scalable Image Feature Compression: A Content-Adaptive and   Diffusion-Based Approach

Sha Guo; Zhuo Chen; Yang Zhao; Ning Zhang; Xiaotong Li; Lingyu Duan

arXiv:2410.06149·cs.CV·October 10, 2024

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Sha Guo, Zhuo Chen, Yang Zhao, Ning Zhang, Xiaotong Li, Lingyu Duan

PDF

TL;DR

This paper presents a scalable, content-adaptive diffusion-based image compression method that improves perceptual quality and preserves features for machine vision tasks, outperforming existing codecs without retraining.

Contribution

Introduces a novel diffusion-based framework for scalable image compression that captures textures and semantics adaptively, enabling flexible compression control and better downstream task performance.

Findings

01

Enhanced perceptual quality over state-of-the-art methods

02

Effective preservation of textures and semantics for machine vision

03

Flexible control over compression ratios without retraining

Abstract

Traditional image codecs emphasize signal fidelity and human perception, often at the expense of machine vision tasks. Deep learning methods have demonstrated promising coding performance by utilizing rich semantic embeddings optimized for both human and machine vision. However, these compact embeddings struggle to capture fine details such as contours and textures, resulting in imperfect reconstructions. Furthermore, existing learning-based codecs lack scalability. To address these limitations, this paper introduces a content-adaptive diffusion model for scalable image compression. The proposed method encodes fine textures through a diffusion process, enhancing perceptual quality while preserving essential features for machine vision tasks. The approach employs a Markov palette diffusion model combined with widely used feature extractors and image generators, enabling efficient data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion