RePoseD: Efficient Relative Pose Estimation With Known Depth Information

Yaqing Ding; Viktor Kocur; V\'aclav V\'avra; Zuzana Berger Haladov\'a,; Jian Yang; Torsten Sattler; Zuzana Kukelova

arXiv:2501.07742·cs.CV·April 4, 2025

RePoseD: Efficient Relative Pose Estimation With Known Depth Information

Yaqing Ding, Viktor Kocur, V\'aclav V\'avra, Zuzana Berger Haladov\'a,, Jian Yang, Torsten Sattler, Zuzana Kukelova

PDF

Open Access 1 Repo

TL;DR

This paper introduces RePoseD, a novel framework that leverages monocular depth estimates for more accurate and efficient relative pose estimation between two cameras, outperforming existing methods.

Contribution

The paper presents new depth-aware solvers that jointly estimate scale and pose, improving speed and accuracy over prior approaches across various camera configurations.

Findings

01

Outperforms state-of-the-art depth-aware solvers in speed and accuracy.

02

Effective across multiple datasets and monocular depth estimation methods.

03

Provides guidelines on which solver to use in different scenarios.

Abstract

Recent advances in monocular depth estimation methods (MDE) and their improved accuracy open new possibilities for their applications. In this paper, we investigate how monocular depth estimates can be used for relative pose estimation. In particular, we are interested in answering the question whether using MDEs improves results over traditional point-based methods. We propose a novel framework for estimating the relative pose of two cameras from point correspondences with associated monocular depths. Since depth predictions are typically defined up to an unknown scale or even both unknown scale and shift parameters, our solvers jointly estimate the scale or both the scale and shift parameters along with the relative pose. We derive efficient solvers considering different types of depths for three camera configurations: (1) two calibrated cameras, (2) two cameras with an unknown shared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yaqding/pose_monodepth
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings