PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Tongda Xu, Jiahao Li, Bin Li, Yan Wang, Ya-Qin Zhang, Yan Lu

TL;DR
PICD introduces a versatile perceptual image compression method that uses diffusion rendering to effectively compress both screen and natural images, especially improving text preservation in screen content.
Contribution
The paper presents a novel compression framework that encodes text and images separately and employs diffusion models with multi-level conditioning for high-quality perceptual reconstruction.
Findings
Outperforms existing perceptual codecs in text accuracy and visual quality
Effective as a perceptual codec for natural images without text conditions
Utilizes a multi-level diffusion model conditioning strategy
Abstract
Recently, perceptual image compression has achieved significant advancements, delivering high visual quality at low bitrates for natural images. However, for screen content, existing methods often produce noticeable artifacts when compressing text. To tackle this challenge, we propose versatile perceptual screen image compression with diffusion rendering (PICD), a codec that works well for both screen and natural images. More specifically, we propose a compression framework that encodes the text and image separately, and renders them into one image using diffusion model. For this diffusion rendering, we integrate conditional information into diffusion models at three distinct levels: 1). Domain level: We fine-tune the base diffusion model using text content prompts with screen content. 2). Adaptor level: We develop an efficient adaptor to control the diffusion model using compressed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Data Compression Techniques · Video Coding and Compression Technologies
MethodsDiffusion · Balanced Selection
