Semantic-Preserving Image Coding based on Conditional Diffusion Models
Francesco Pezone, Osman Musa, Giuseppe Caire, Sergio Barbarossa

TL;DR
This paper introduces SPIC, a semantic image coding method that uses conditional diffusion models to encode semantic maps and low-res images, achieving better semantic preservation and rate-distortion trade-offs.
Contribution
It presents a novel semantic image coding scheme leveraging conditional diffusion models to enhance semantic preservation in image transmission.
Findings
Outperforms state-of-the-art methods in semantic preservation
Achieves a better rate-distortion trade-off
Demonstrates effective reconstruction of high-resolution images
Abstract
Semantic communication, rather than on a bit-by-bit recovery of the transmitted messages, focuses on the meaning and the goal of the communication itself. In this paper, we propose a novel semantic image coding scheme that preserves the semantic content of an image, while ensuring a good trade-off between coding rate and image quality. The proposed Semantic-Preserving Image Coding based on Conditional Diffusion Models (SPIC) transmitter encodes a Semantic Segmentation Map (SSM) and a low-resolution version of the image to be transmitted. The receiver then reconstructs a high-resolution image using a Denoising Diffusion Probabilistic Models (DDPM) doubly conditioned to the SSM and the low-resolution image. As shown by the numerical examples, compared to state-of-the-art (SOTA) approaches, the proposed SPIC exhibits a better balance between the conventional rate-distortion trade-off and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Data Compression Techniques · Image and Signal Denoising Methods
