FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding

Pengxiang Wu; Siman Wang; Kevin Dela Rosa; Derek Hao Hu

arXiv:2309.16249·cs.CV·September 29, 2023·1 cites

FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding

Pengxiang Wu, Siman Wang, Kevin Dela Rosa, Derek Hao Hu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces FORB, a new benchmark dataset for flat object image retrieval, addressing the limitations of existing datasets by evaluating retrieval methods on diverse 2D flat objects and out-of-distribution domains.

Contribution

The paper presents a novel flat object retrieval benchmark (FORB) that expands evaluation beyond 3D landmarks to include diverse 2D flat objects, facilitating better assessment of image embedding quality.

Findings

01

Retrieval accuracy varies significantly across different methods.

02

Matching score margin provides additional insights into retrieval performance.

03

The benchmark reveals challenges and heterogeneity in flat object retrieval.

Abstract

Image retrieval is a fundamental task in computer vision. Despite recent advances in this field, many techniques have been evaluated on a limited number of domains, with a small number of instance categories. Notably, most existing works only consider domains like 3D landmarks, making it difficult to generalize the conclusions made by these works to other domains, e.g., logo and other 2D flat objects. To bridge this gap, we introduce a new dataset for benchmarking visual search methods on flat images with diverse patterns. Our flat object retrieval benchmark (FORB) supplements the commonly adopted 3D object domain, and more importantly, it serves as a testbed for assessing the image embedding quality on out-of-distribution domains. In this benchmark we investigate the retrieval accuracy of representative methods in terms of candidate ranks, as well as matching score margin, a viewpoint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pxiangwu/forb
pytorchOfficial

Videos

FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding· slideslive

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning