Deep Camera Pose Regression Using Pseudo-LiDAR
Ali Raza, Lazar Lolic, Shahmir Akhter, Alfonso Dela Cruz, Michael Liut

TL;DR
This paper demonstrates that using pseudo-LiDAR signals for camera pose regression significantly improves localization accuracy over traditional depth map methods, introducing a novel dual-stream neural network architecture called FusionLoc.
Contribution
The paper introduces FusionLoc, a dual-stream neural network that leverages pseudo-LiDAR for improved 6DOF camera pose estimation, outperforming existing RGB-D based methods.
Findings
FusionLoc outperforms RGB-D PoseNet by 0.33m and 4.35° on average.
Pseudo-LiDAR representations yield better localization accuracy than depth maps.
Using pseudo-LiDAR enhances large-scale camera localization systems.
Abstract
An accurate and robust large-scale localization system is an integral component for active areas of research such as autonomous vehicles and augmented reality. To this end, many learning algorithms have been proposed that predict 6DOF camera pose from RGB or RGB-D images. However, previous methods that incorporate depth typically treat the data the same way as RGB images, often adding depth maps as additional channels to RGB images and passing them through convolutional neural networks (CNNs). In this paper, we show that converting depth maps into pseudo-LiDAR signals, previously shown to be useful for 3D object detection, is a better representation for camera localization tasks by projecting point clouds that can accurately determine 6DOF camera pose. This is demonstrated by first comparing localization accuracies of a network operating exclusively on pseudo-LiDAR representations, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Image and Object Detection Techniques · Robot Manipulation and Learning
