End-to-end learning of keypoint detection and matching for relative pose   estimation

Antoine Fond; Luca Del Pero; Nikola Sivacki; Marco Paladini

arXiv:2104.01085·cs.CV·April 5, 2021·1 cites

End-to-end learning of keypoint detection and matching for relative pose estimation

Antoine Fond, Luca Del Pero, Nikola Sivacki, Marco Paladini

PDF

Open Access

TL;DR

This paper introduces an end-to-end trainable system for keypoint detection, description, matching, and pose estimation, significantly improving visual localization accuracy for applications like AR and robotic mapping.

Contribution

It presents a novel end-to-end learning framework that integrates all steps of pose estimation, including feature matching, which was traditionally done separately.

Findings

01

Achieves state-of-the-art localization accuracy on 7 Scenes dataset.

02

Demonstrates robustness and real-time performance in visual localization tasks.

03

Unified learning approach simplifies the pipeline and improves overall accuracy.

Abstract

We propose a new method for estimating the relative pose between two images, where we jointly learn keypoint detection, description extraction, matching and robust pose estimation. While our architecture follows the traditional pipeline for pose estimation from geometric computer vision, all steps are learnt in an end-to-end fashion, including feature matching. We demonstrate our method for the task of visual localization of a query image within a database of images with known pose. Pairwise pose estimation has many practical applications for robotic mapping, navigation, and AR. For example, the display of persistent AR objects in the scene relies on a precise camera localization to make the digital models appear anchored to the physical environment. We train our pipeline end-to-end specifically for the problem of visual localization. We evaluate our proposed approach on localization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques