Semantic Similarity Score for Measuring Visual Similarity at Semantic   Level

Senran Fan; Zhicheng Bao; Chen Dong; Haotai Liang; Xiaodong Xu; Ping; Zhang

arXiv:2406.03865·cs.CV·July 11, 2024

Semantic Similarity Score for Measuring Visual Similarity at Semantic Level

Senran Fan, Zhicheng Bao, Chen Dong, Haotai Liang, Xiaodong Xu, Ping, Zhang

PDF

Open Access

TL;DR

This paper introduces SeSS, a semantic similarity metric based on scene graph matching, to evaluate semantic-level image similarity, addressing limitations of traditional pixel-based metrics in semantic communication systems.

Contribution

The paper proposes a novel semantic similarity score, SeSS, utilizing scene graph generation and matching, with fine-tuning based on human-annotated data to better reflect semantic perception.

Findings

01

SeSS effectively measures semantic differences in images transmitted by various communication systems.

02

SeSS correlates well with human semantic perception in diverse scenarios.

03

The metric outperforms traditional pixel-based similarity measures in semantic evaluation.

Abstract

Semantic communication, as a revolutionary communication architecture, is considered a promising novel communication paradigm. Unlike traditional symbol-based error-free communication systems, semantic-based visual communication systems extract, compress, transmit, and reconstruct images at the semantic level. However, widely used image similarity evaluation metrics, whether pixel-based MSE or PSNR or structure-based MS-SSIM, struggle to accurately measure the loss of semantic-level information of the source during system transmission. This presents challenges in evaluating the performance of visual semantic communication systems, especially when comparing them with traditional communication systems. To address this, we propose a semantic evaluation metric -- SeSS (Semantic Similarity Score), based on Scene Graph Generation and graph matching, which shifts the similarity scores between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques