Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding

Jianhao Huang; Qunsong Zeng; Kaibin Huang

arXiv:2505.10405·eess.IV·May 16, 2025

Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding

Jianhao Huang, Qunsong Zeng, Kaibin Huang

PDF

Open Access

TL;DR

This paper introduces a hybrid generative semantic communication system with critical information embedding and a new visual fidelity metric, improving image reconstruction quality and system adaptability in 6G networks.

Contribution

It proposes a novel semantic filtering approach for critical feature extraction and a new GVIF metric for visual quality evaluation, enhancing Gen-SemCom performance.

Findings

01

GVIF correlates well with PSNR and FID scores.

02

The system achieves higher PSNR and lower FID than benchmarks.

03

Adaptive control of features improves visual fidelity.

Abstract

Generative semantic communication (Gen-SemCom) with large artificial intelligence (AI) model promises a transformative paradigm for 6G networks, which reduces communication costs by transmitting low-dimensional prompts rather than raw data. However, purely prompt-driven generation loses fine-grained visual details. Additionally, there is a lack of systematic metrics to evaluate the performance of Gen-SemCom systems. To address these issues, we develop a hybrid Gen-SemCom system with a critical information embedding (CIE) framework, where both text prompts and semantically critical features are extracted for transmissions. First, a novel approach of semantic filtering is proposed to select and transmit the semantically critical features of images relevant to semantic label. By integrating the text prompt and critical features, the receiver reconstructs high-fidelity images using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Computing and Networks