Without Paired Labeled Data: End-to-End Self-Supervised Learning for Drone-view Geo-Localization
Zhongwei Chen, Zhao-Xu Yang, Hai-Jun Rong, Guoqi Li

TL;DR
This paper introduces a novel self-supervised learning approach for drone-view geo-localization that does not require paired labeled data, using clustering, contrastive learning, and memory modules to improve cross-view feature alignment.
Contribution
It proposes a new end-to-end self-supervised method with modules for dynamic memory and neighborhood information learning, surpassing existing supervised and self-supervised methods.
Findings
Outperforms existing self-supervised methods on benchmark datasets.
Surpasses several state-of-the-art supervised methods.
Demonstrates robustness without paired labeled data.
Abstract
Drone-view Geo-Localization (DVGL) aims to achieve accurate localization of drones by retrieving the most relevant GPS-tagged satellite images. However, most existing methods heavily rely on strictly pre-paired drone-satellite images for supervised learning. When the target region shifts, new paired samples are typically required to adapt to the distribution changes. The high cost of annotation and the limited transferability of these methods significantly hinder the practical deployment of DVGL in open-world scenarios. To address these limitations, we propose a novel end-to-end self-supervised learning method with a shallow backbone network, called the dynamic memory-driven and neighborhood information learning (DMNIL) method. It employs a clustering algorithm to generate pseudo-labels and adopts a dual-path contrastive learning framework to learn discriminative intra-view…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage
MethodsContrastive Learning
