Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding
Jianhao Huang, Qunsong Zeng, Kaibin Huang

TL;DR
This paper introduces a hybrid generative semantic communication system with critical information embedding and a new visual fidelity metric, improving image reconstruction quality and system adaptability in 6G networks.
Contribution
It proposes a novel semantic filtering approach for critical feature extraction and a new GVIF metric for visual quality evaluation, enhancing Gen-SemCom performance.
Findings
GVIF correlates well with PSNR and FID scores.
The system achieves higher PSNR and lower FID than benchmarks.
Adaptive control of features improves visual fidelity.
Abstract
Generative semantic communication (Gen-SemCom) with large artificial intelligence (AI) model promises a transformative paradigm for 6G networks, which reduces communication costs by transmitting low-dimensional prompts rather than raw data. However, purely prompt-driven generation loses fine-grained visual details. Additionally, there is a lack of systematic metrics to evaluate the performance of Gen-SemCom systems. To address these issues, we develop a hybrid Gen-SemCom system with a critical information embedding (CIE) framework, where both text prompts and semantically critical features are extracted for transmissions. First, a novel approach of semantic filtering is proposed to select and transmit the semantically critical features of images relevant to semantic label. By integrating the text prompt and critical features, the receiver reconstructs high-fidelity images using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Computing and Networks
