Efficient Discovery and Effective Evaluation of Visual Perceptual   Similarity: A Benchmark and Beyond

Oren Barkan; Tal Reiss; Jonathan Weill; Ori Katz; Roy Hirsch; Itzik; Malkiel; Noam Koenigstein

arXiv:2308.14753·cs.CV·August 29, 2023

Efficient Discovery and Effective Evaluation of Visual Perceptual Similarity: A Benchmark and Beyond

Oren Barkan, Tal Reiss, Jonathan Weill, Ori Katz, Roy Hirsch, Itzik, Malkiel, Noam Koenigstein

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a large-scale fashion visual similarity benchmark with expert annotations, proposes a new efficient labeling method, and discusses evaluation metrics to improve the assessment of visual similarity models.

Contribution

It provides the first extensive fashion VSD dataset, a novel labeling procedure, and insights into evaluation metrics beyond proxy tasks.

Findings

01

Created a dataset with 110K expert-annotated pairs

02

Proposed an efficient labeling procedure applicable to other datasets

03

Analyzed limitations and biases of current evaluation metrics

Abstract

Visual similarities discovery (VSD) is an important task with broad e-commerce applications. Given an image of a certain object, the goal of VSD is to retrieve images of different objects with high perceptual visual similarity. Although being a highly addressed problem, the evaluation of proposed methods for VSD is often based on a proxy of an identification-retrieval task, evaluating the ability of a model to retrieve different images of the same object. We posit that evaluating VSD methods based on identification tasks is limited, and faithful evaluation must rely on expert annotations. In this paper, we introduce the first large-scale fashion visual similarity benchmark dataset, consisting of more than 110K expert-annotated image pairs. Besides this major contribution, we share insight from the challenges we faced while curating this dataset. Based on these insights, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vsd-benchmark/vsd
pytorchOfficial

Datasets

vsd-benchmark/vsd-fashion
dataset· 207 dl
207 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsFocus