Google Landmarks Dataset v2 -- A Large-Scale Benchmark for   Instance-Level Recognition and Retrieval

Tobias Weyand; Andre Araujo; Bingyi Cao; Jack Sim

arXiv:2004.01804·cs.CV·November 3, 2020·28 cites

Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval

Tobias Weyand, Andre Araujo, Bingyi Cao, Jack Sim

PDF

Open Access 5 Repos 1 Models 1 Datasets 1 Video

TL;DR

Google Landmarks Dataset v2 is a large-scale, challenging benchmark for fine-grained landmark recognition and retrieval, featuring over 5 million images with diverse, real-world variability to advance research in this domain.

Contribution

The paper introduces the largest landmark dataset to date, with challenging real-world properties, and provides baseline results and evaluation tools for recognition and retrieval tasks.

Findings

01

Baseline methods achieve competitive results on the dataset.

02

The dataset enables effective transfer learning for landmark recognition.

03

Real-world variability makes the dataset a challenging benchmark.

Abstract

While image retrieval and instance recognition techniques are progressing rapidly, there is a need for challenging datasets to accurately measure their performance -- while posing novel challenges that are relevant for practical applications. We introduce the Google Landmarks Dataset v2 (GLDv2), a new benchmark for large-scale, fine-grained instance recognition and image retrieval in the domain of human-made and natural landmarks. GLDv2 is the largest such dataset to date by a large margin, including over 5M images and 200k distinct instance labels. Its test set consists of 118k images with ground truth annotations for both the retrieval and recognition tasks. The ground truth construction involved over 800 hours of human annotator work. Our new dataset has several challenging properties inspired by real world applications that previous datasets did not consider: An extremely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
Shiro/DOLG
model

Datasets

larrygzlb1994/HolidayRePaint
dataset· 24 dl
24 dl

Videos

Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval· youtube

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Average Pooling · Residual Connection · Additive Angular Margin Loss · Max Pooling · Global Average Pooling · Bottleneck Residual Block · Residual Block