Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation
Kejia Liu, Haoyang Zhou, Ruoyu Xu, Peicheng Wang, Mingli Song, Haofei Zhang

TL;DR
Bearing-UAV introduces a vision-only cross-view navigation approach that jointly predicts UAV location and heading, improving robustness and accuracy over existing methods, and provides a new benchmark dataset for evaluation.
Contribution
The paper proposes Bearing-UAV, a novel method for cross-view UAV navigation that encodes spatial relationships and jointly predicts location and heading, enhancing robustness and efficiency.
Findings
Bearing-UAV achieves lower localization error than previous methods.
The approach is robust to cross-view variations and feature-sparse conditions.
The new benchmark dataset facilitates evaluation of cross-view localization methods.
Abstract
Recent advances in cross-view geo-localization (CVGL) methods have shown strong potential for supporting unmanned aerial vehicle (UAV) navigation in GNSS-denied environments. However, existing work predominantly focuses on matching UAV views to onboard map tiles, which introduces an inherent trade-off between accuracy and storage overhead, and overlooks the importance of the UAV's heading during navigation. Moreover, the substantial discrepancies and varying overlaps in cross-view scenarios have been insufficiently considered, limiting their generalization to real-world scenarios. In this paper, we present Bearing-UAV, a purely vision-driven cross-view navigation method that jointly predicts UAV absolute location and heading from neighboring features, enabling accurate, lightweight, and robust navigation in the wild. Our method leverages global and local structural features and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
