CorrEmbed: Evaluating Pre-trained Model Image Similarity Efficacy with a Novel Metric
Karl Audun Kagnes Borgersen, Morten Goodwin, Jivitesh Sharma, Tobias, Aasmoe, Mari Leonhardsen, Gro Herredsvela R{\o}rvik

TL;DR
CorrEmbed introduces a novel metric to evaluate pre-trained image embeddings by correlating embedding distances with human-generated tag similarities, revealing insights into model performance for image similarity tasks.
Contribution
The paper proposes CorrEmbed, a new evaluation method that assesses pre-trained models' image embeddings based on their alignment with human tag similarities, addressing a gap in existing metrics.
Findings
Linear relationship between ImageNet accuracy and tag-correlation scores
CorrEmbed identifies models that deviate from expected performance patterns
Provides insights into how models capture high-level image features
Abstract
Detecting visually similar images is a particularly useful attribute to look to when calculating product recommendations. Embedding similarity, which utilizes pre-trained computer vision models to extract high-level image features, has demonstrated remarkable efficacy in identifying images with similar compositions. However, there is a lack of methods for evaluating the embeddings generated by these models, as conventional loss and performance metrics do not adequately capture their performance in image similarity search tasks. In this paper, we evaluate the viability of the image embeddings from numerous pre-trained computer vision models using a novel approach named CorrEmbed. Our approach computes the correlation between distances in image embeddings and distances in human-generated tag vectors. We extensively evaluate numerous pre-trained Torchvision models using this metric,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection
