CorrEmbed: Evaluating Pre-trained Model Image Similarity Efficacy with a   Novel Metric

Karl Audun Kagnes Borgersen; Morten Goodwin; Jivitesh Sharma; Tobias; Aasmoe; Mari Leonhardsen; Gro Herredsvela R{\o}rvik

arXiv:2308.16126·cs.CV·August 31, 2023

CorrEmbed: Evaluating Pre-trained Model Image Similarity Efficacy with a Novel Metric

Karl Audun Kagnes Borgersen, Morten Goodwin, Jivitesh Sharma, Tobias, Aasmoe, Mari Leonhardsen, Gro Herredsvela R{\o}rvik

PDF

Open Access 1 Repo

TL;DR

CorrEmbed introduces a novel metric to evaluate pre-trained image embeddings by correlating embedding distances with human-generated tag similarities, revealing insights into model performance for image similarity tasks.

Contribution

The paper proposes CorrEmbed, a new evaluation method that assesses pre-trained models' image embeddings based on their alignment with human tag similarities, addressing a gap in existing metrics.

Findings

01

Linear relationship between ImageNet accuracy and tag-correlation scores

02

CorrEmbed identifies models that deviate from expected performance patterns

03

Provides insights into how models capture high-level image features

Abstract

Detecting visually similar images is a particularly useful attribute to look to when calculating product recommendations. Embedding similarity, which utilizes pre-trained computer vision models to extract high-level image features, has demonstrated remarkable efficacy in identifying images with similar compositions. However, there is a lack of methods for evaluating the embeddings generated by these models, as conventional loss and performance metrics do not adequately capture their performance in image similarity search tasks. In this paper, we evaluate the viability of the image embeddings from numerous pre-trained computer vision models using a novel approach named CorrEmbed. Our approach computes the correlation between distances in image embeddings and distances in human-generated tag vectors. We extensively evaluate numerous pre-trained Torchvision models using this metric,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cair/CorrEmbed_Evaluating_Pre-trained_Model_Efficacy
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection