VG-SSL: Benchmarking Self-supervised Representation Learning Approaches for Visual Geo-localization
Jiuhong Xiao, Gao Zhu, Giuseppe Loianno

TL;DR
This paper introduces VG-SSL, a benchmarking framework for self-supervised learning in visual geo-localization, demonstrating that contrastive and information maximization methods can match or outperform supervised approaches.
Contribution
It presents the first benchmarking study of SSL methods in visual geo-localization, with a novel geo-related pair strategy and extensive performance analysis.
Findings
Contrastive learning improves geo-specific representations.
Information maximization methods perform well in VG tasks.
SSL methods can match or surpass state-of-the-art VG techniques.
Abstract
Visual Geo-localization (VG) is a critical research area for identifying geo-locations from visual inputs, particularly in autonomous navigation for robotics and vehicles. Current VG methods often learn feature extractors from geo-labeled images to create dense, geographically relevant representations. Recent advances in Self-Supervised Learning (SSL) have demonstrated its capability to achieve performance on par with supervised techniques with unlabeled images. This study presents a novel VG-SSL framework, designed for versatile integration and benchmarking of diverse SSL methods for representation learning in VG, featuring a unique geo-related pair strategy, GeoPair. Through extensive performance analysis, we adapt SSL techniques to improve VG on datasets from hand-held and car-mounted cameras used in robotics and autonomous vehicles. Our results show that contrastive learning and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
MethodsResidual Block · Residual Connection · 1x1 Convolution · Batch Normalization · Color Jitter · Kaiming Initialization · Dense Connections · Random Resized Crop · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block
