Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models
Pujing Yang, Guangyi Zhang, and Yunlong Cai

TL;DR
This paper introduces a rate-adaptive generative semantic communication system using conditional diffusion models, which improves perceptual quality of transmitted images by leveraging entropy-based bandwidth management and advanced attention mechanisms.
Contribution
The paper presents a novel generative DJSCC framework with entropy-guided transmission and a multi-stage training strategy, enhancing perceptual quality over existing methods.
Findings
Significantly better perceptual quality than existing approaches.
Effective bandwidth management via entropy models.
Fast inference enabled by MLLA skeleton.
Abstract
Recent advances in deep learning-based joint source-channel coding (DJSCC) have shown promise for end-to-end semantic image transmission. However, most existing schemes primarily focus on optimizing pixel-wise metrics, which often fail to align with human perception, leading to lower perceptual quality. In this letter, we propose a novel generative DJSCC approach using conditional diffusion models to enhance the perceptual quality of transmitted images. Specifically, by utilizing entropy models, we effectively manage transmission bandwidth based on the estimated entropy of transmitted sym-bols. These symbols are then used at the receiver as conditional information to guide a conditional diffusion decoder in image reconstruction. Our model is built upon the emerging advanced mamba-like linear attention (MLLA) skeleton, which excels in image processing tasks while also offering fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence
MethodsSoftmax · Attention Is All You Need · Diffusion · Focus · ALIGN
