Semantic Similarity Score for Measuring Visual Similarity at Semantic Level
Senran Fan, Zhicheng Bao, Chen Dong, Haotai Liang, Xiaodong Xu, Ping, Zhang

TL;DR
This paper introduces SeSS, a semantic similarity metric based on scene graph matching, to evaluate semantic-level image similarity, addressing limitations of traditional pixel-based metrics in semantic communication systems.
Contribution
The paper proposes a novel semantic similarity score, SeSS, utilizing scene graph generation and matching, with fine-tuning based on human-annotated data to better reflect semantic perception.
Findings
SeSS effectively measures semantic differences in images transmitted by various communication systems.
SeSS correlates well with human semantic perception in diverse scenarios.
The metric outperforms traditional pixel-based similarity measures in semantic evaluation.
Abstract
Semantic communication, as a revolutionary communication architecture, is considered a promising novel communication paradigm. Unlike traditional symbol-based error-free communication systems, semantic-based visual communication systems extract, compress, transmit, and reconstruct images at the semantic level. However, widely used image similarity evaluation metrics, whether pixel-based MSE or PSNR or structure-based MS-SSIM, struggle to accurately measure the loss of semantic-level information of the source during system transmission. This presents challenges in evaluating the performance of visual semantic communication systems, especially when comparing them with traditional communication systems. To address this, we propose a semantic evaluation metric -- SeSS (Semantic Similarity Score), based on Scene Graph Generation and graph matching, which shifts the similarity scores between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques
