TL;DR
This paper introduces a novel cross-view geo-localization method that leverages drone-view images as a bridge to improve ground-to-satellite image retrieval accuracy, addressing large viewpoint variations and irrelevant backgrounds.
Contribution
It proposes a Peer Learning and Cross Diffusion framework utilizing drone-view data to enhance ground-to-satellite cross-view image retrieval, a novel approach in this domain.
Findings
Outperforms state-of-the-art methods on University-Earth and University-Google datasets.
Effectively handles large viewpoint changes and background clutter.
Demonstrates significant accuracy improvements in cross-view geo-localization.
Abstract
The large variation of viewpoint and irrelevant content around the target always hinder accurate image retrieval and its subsequent tasks. In this paper, we investigate an extremely challenging task: given a ground-view image of a landmark, we aim to achieve cross-view geo-localization by searching out its corresponding satellite-view images. Specifically, the challenge comes from the gap between ground-view and satellite-view, which includes not only large viewpoint changes (some parts of the landmark may be invisible from front view to top view) but also highly irrelevant background (the target landmark tend to be hidden in other surrounding buildings), making it difficult to learn a common representation or a suitable mapping. To address this issue, we take advantage of drone-view information as a bridge between ground-view and satellite-view domains. We propose a Peer Learning and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
