Window-to-Window BEV Representation Learning for Limited FoV Cross-View   Geo-localization

Lei Cheng; Teng Wang; Lingquan Meng; Changyin Sun

arXiv:2407.06861·cs.CV·July 10, 2024

Window-to-Window BEV Representation Learning for Limited FoV Cross-View Geo-localization

Lei Cheng, Teng Wang, Lingquan Meng, Changyin Sun

PDF

Open Access

TL;DR

This paper introduces W2W-BEV, a novel method for cross-view geo-localization that learns BEV representations directly from ground images with limited field of view and unknown orientation, significantly improving accuracy.

Contribution

The paper proposes a window-to-window BEV learning approach that adaptively matches BEV queries to ground references, addressing orientation ambiguity and FoV limitations.

Findings

01

W2W-BEV outperforms previous methods on benchmark datasets.

02

Achieves 64.73% R@1 accuracy on CVUSA with limited FoV.

03

Demonstrates robustness under unknown orientation conditions.

Abstract

Cross-view geo-localization confronts significant challenges due to large perspective changes, especially when the ground-view query image has a limited field of view with unknown orientation. To bridge the cross-view domain gap, we for the first time explore to learn a BEV representation directly from the ground query image. However, the unknown orientation between ground and aerial images combined with the absence of camera parameters led to ambiguity between BEV queries and ground references. To tackle this challenge, we propose a novel Window-to-Window BEV representation learning method, termed W2W-BEV, which adaptively matches BEV queries to ground reference at window-scale. Specifically, predefined BEV embeddings and extracted ground features are segmented into a fixed number of windows, and then most similar ground window is chosen for each BEV feature based on the context-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Robotics and Sensor-Based Localization