Window-to-Window BEV Representation Learning for Limited FoV Cross-View Geo-localization
Lei Cheng, Teng Wang, Lingquan Meng, Changyin Sun

TL;DR
This paper introduces W2W-BEV, a novel method for cross-view geo-localization that learns BEV representations directly from ground images with limited field of view and unknown orientation, significantly improving accuracy.
Contribution
The paper proposes a window-to-window BEV learning approach that adaptively matches BEV queries to ground references, addressing orientation ambiguity and FoV limitations.
Findings
W2W-BEV outperforms previous methods on benchmark datasets.
Achieves 64.73% R@1 accuracy on CVUSA with limited FoV.
Demonstrates robustness under unknown orientation conditions.
Abstract
Cross-view geo-localization confronts significant challenges due to large perspective changes, especially when the ground-view query image has a limited field of view with unknown orientation. To bridge the cross-view domain gap, we for the first time explore to learn a BEV representation directly from the ground query image. However, the unknown orientation between ground and aerial images combined with the absence of camera parameters led to ambiguity between BEV queries and ground references. To tackle this challenge, we propose a novel Window-to-Window BEV representation learning method, termed W2W-BEV, which adaptively matches BEV queries to ground reference at window-scale. Specifically, predefined BEV embeddings and extracted ground features are segmented into a fixed number of windows, and then most similar ground window is chosen for each BEV feature based on the context-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Robotics and Sensor-Based Localization
