Sparse Semantic Map-Based Monocular Localization in Traffic Scenes Using Learned 2D-3D Point-Line Correspondences
Xingyu Chen, Jianru Xue, and Shanmin Pang

TL;DR
This paper introduces a robust monocular localization method for autonomous vehicles that uses sparse semantic maps and deep learning to improve pose estimation under occlusion and environmental changes.
Contribution
It presents a novel sparse semantic map-based approach that leverages 2D-3D point-line correspondences learned through deep neural networks for improved localization.
Findings
Outperforms state-of-the-art methods in accuracy
Robust against occlusion and appearance changes
Effective in diverse traffic scene conditions
Abstract
Vision-based localization in a prior map is of crucial importance for autonomous vehicles. Given a query image, the goal is to estimate the camera pose corresponding to the prior map, and the key is the registration problem of camera images within the map. While autonomous vehicles drive on the road under occlusion (e.g., car, bus, truck) and changing environment appearance (e.g., illumination changes, seasonal variation), existing approaches rely heavily on dense point descriptors at the feature level to solve the registration problem, entangling features with appearance and occlusion. As a result, they often fail to estimate the correct poses. To address these issues, we propose a sparse semantic map-based monocular localization method, which solves 2D-3D registration via a well-designed deep neural network. Given a sparse semantic map that consists of simplified elements (e.g., pole…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
