Vision-based Semantic Mapping and Localization for Autonomous Indoor Parking
Yewei Huang, Junqiao Zhao, Xudong He, Shaoming Zhang and, Tiantian Feng

TL;DR
This paper presents a real-time, vision-based semantic mapping and localization system for autonomous indoor parking, utilizing high-level landmarks, data association optimization, and visual markers to achieve accurate vehicle positioning.
Contribution
It introduces a novel approach combining semantic parking slot mapping, data association refinement, and visual fiducial markers for improved indoor localization accuracy.
Findings
Achieved 0.3m average localization accuracy at 10kph.
Developed a fully automatic semantic parking lot mapping system.
Validated performance on the TiEV autonomous driving platform.
Abstract
In this paper, we proposed a novel and practical solution for the real-time indoor localization of autonomous driving in parking lots. High-level landmarks, the parking slots, are extracted and enriched with labels to avoid the aliasing of low-level visual features. We then proposed a robust method for detecting incorrect data associations between parking slots and further extended the optimization framework by dynamically eliminating suboptimal data associations. Visual fiducial markers are introduced to improve the overall precision. As a result, a semantic map of the parking lot can be established fully automatically and robustly. We experimented the performance of real-time localization based on the map using our autonomous driving platform TiEV, and the average accuracy of 0.3m track tracing can be achieved at a speed of 10kph.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Smart Parking Systems Research · Indoor and Outdoor Localization Technologies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
