Direct-PoseNet: Absolute Pose Regression with Photometric Consistency
Shuai Chen, Zirui Wang, Victor Prisacariu

TL;DR
This paper introduces Direct-PoseNet, a novel absolute pose regression network that uses photometric consistency and differentiable rendering to improve localization accuracy while maintaining fast inference.
Contribution
It proposes a direct matching module with photometric supervision and adopts SO(3) rotation representation, enhancing accuracy and simplifying training compared to prior APR methods.
Findings
Achieves state-of-the-art results on 7-Scenes and LLFF datasets.
Maintains low inference time with improved accuracy.
Outperforms existing single-image APR methods.
Abstract
We present a relocalization pipeline, which combines an absolute pose regression (APR) network with a novel view synthesis based direct matching module, offering superior accuracy while maintaining low inference time. Our contribution is twofold: i) we design a direct matching module that supplies a photometric supervision signal to refine the pose regression network via differentiable rendering; ii) we modify the rotation representation from the classical quaternion to SO(3) in pose regression, removing the need for balancing rotation and translation loss terms. As a result, our network Direct-PoseNet achieves state-of-the-art performance among all other single-image APR methods on the 7-Scenes benchmark and the LLFF dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
