CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud
Evgeny Yudin, Pavel Karpyshev, Mikhail Kurenkov, Alena Savinykh,, Andrei Potapov, Evgeny Kruzhkov, and Dzmitry Tsetserukou

TL;DR
This paper introduces CloudVision, a visual localization method that leverages prebuilt LiDAR maps and camera data to achieve centimeter-level accuracy in robot positioning, enabling high-precision autonomous navigation.
Contribution
The paper presents a novel approach combining LiDAR-based mapping with camera localization, reducing the need for expensive sensors on operational robots.
Findings
Achieved median translation error of 1.3 cm.
Enabled high-precision localization using only RGB cameras.
Validated on a custom dataset from Skoltech.
Abstract
In this study, we propose a novel visual localization approach to accurately estimate six degrees of freedom (6-DoF) poses of the robot within the 3D LiDAR map based on visual data from an RGB camera. The 3D map is obtained utilizing an advanced LiDAR-based simultaneous localization and mapping (SLAM) algorithm capable of collecting a precise sparse map. The features extracted from the camera images are compared with the points of the 3D map, and then the geometric optimization problem is being solved to achieve precise visual localization. Our approach allows employing a scout robot equipped with an expensive LiDAR only once - for mapping of the environment, and multiple operational robots with only RGB cameras onboard - for performing mission tasks, with the localization accuracy higher than common camera-based solutions. The proposed method was tested on the custom dataset collected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage
