Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Tobias Weyand, Andre Araujo, Bingyi Cao, Jack Sim

TL;DR
Google Landmarks Dataset v2 is a large-scale, challenging benchmark for fine-grained landmark recognition and retrieval, featuring over 5 million images with diverse, real-world variability to advance research in this domain.
Contribution
The paper introduces the largest landmark dataset to date, with challenging real-world properties, and provides baseline results and evaluation tools for recognition and retrieval tasks.
Findings
Baseline methods achieve competitive results on the dataset.
The dataset enables effective transfer learning for landmark recognition.
Real-world variability makes the dataset a challenging benchmark.
Abstract
While image retrieval and instance recognition techniques are progressing rapidly, there is a need for challenging datasets to accurately measure their performance -- while posing novel challenges that are relevant for practical applications. We introduce the Google Landmarks Dataset v2 (GLDv2), a new benchmark for large-scale, fine-grained instance recognition and image retrieval in the domain of human-made and natural landmarks. GLDv2 is the largest such dataset to date by a large margin, including over 5M images and 200k distinct instance labels. Its test set consists of 118k images with ground truth annotations for both the retrieval and recognition tasks. The ground truth construction involved over 800 hours of human annotator work. Our new dataset has several challenging properties inspired by real world applications that previous datasets did not consider: An extremely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval· youtube
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Average Pooling · Residual Connection · Additive Angular Margin Loss · Max Pooling · Global Average Pooling · Bottleneck Residual Block · Residual Block
