Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via   Geometry-Guided Cross-View Transformer

Yujiao Shi; Fei Wu; Akhil Perincherry; Ankit Vora; and Hongdong Li

arXiv:2307.08015·cs.CV·July 21, 2023

Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer

Yujiao Shi, Fei Wu, Akhil Perincherry, Ankit Vora, and Hongdong Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a geometry-guided cross-view transformer that significantly enhances ground-to-satellite camera localization accuracy by effectively estimating relative rotation and translation, outperforming existing methods on benchmark datasets.

Contribution

It proposes a novel geometry-guided cross-view transformer and neural pose optimizer for precise localization, integrating geometric and learnable features for improved accuracy.

Findings

01

Likelihood of lateral pose within 1m increased from 35.54% to 76.44%.

02

Orientation within 1° improved from 19.64% to 99.10%.

03

Method outperforms state-of-the-art on KITTI dataset.

Abstract

Image retrieval-based cross-view localization methods often lead to very coarse camera pose estimation, due to the limited sampling density of the database satellite images. In this paper, we propose a method to increase the accuracy of a ground camera's location and orientation by estimating the relative rotation and translation between the ground-level image and its matched/retrieved satellite image. Our approach designs a geometry-guided cross-view transformer that combines the benefits of conventional geometry and learnable cross-view transformers to map the ground-view observations to an overhead view. Given the synthesized overhead view and observed satellite feature maps, we construct a neural pose optimizer with strong global information embedding ability to estimate the relative rotation between them. After aligning their rotations, we develop an uncertainty-guided spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shiyujiao/boosting3dofaccuracy
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging