How to Evaluate Semantic Communications for Images with ViTScore Metric?
Tingting Zhu, Bo Peng, Jifan Liang, Tingchen Han, Hai Wan, Jingqiao, Fu, and Junjie Chen

TL;DR
This paper introduces ViTScore, a novel metric inspired by NLP's BERTScore, designed to evaluate semantic similarity between images, addressing limitations of traditional pixel-based metrics in semantic communication systems.
Contribution
The paper proposes ViTScore, a new image semantic similarity metric with proven properties, and demonstrates its effectiveness over existing metrics in various semantic communication scenarios.
Findings
ViTScore outperforms PSNR, MS-SSIM, and LPIPS in semantic similarity evaluation.
ViTScore is robust against semantic attacks like GAN-based image inversions.
The metric is theoretically justified with properties like symmetry, boundedness, and normalization.
Abstract
Semantic communications (SC) have been expected to be a new paradigm shifting to catalyze the next generation communication, whose main concerns shift from accurate bit transmission to effective semantic information exchange in communications. However, the previous and widely-used metrics for images are not applicable to evaluate the image semantic similarity in SC. Classical metrics to measure the similarity between two images usually rely on the pixel level or the structural level, such as the PSNR and the MS-SSIM. Straightforwardly using some tailored metrics based on deep-learning methods in CV community, such as the LPIPS, is infeasible for SC. To tackle this, inspired by BERTScore in NLP community, we propose a novel metric for evaluating image semantic similarity, named Vision Transformer Score (ViTScore). We prove theoretically that ViTScore has 3 important properties, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Misinformation and Its Impacts · Domain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer
