Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving   Applications

Qing Cheng; Niclas Zeller; Daniel Cremers

arXiv:2203.01087·cs.CV·March 3, 2022

Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications

Qing Cheng, Niclas Zeller, Daniel Cremers

PDF

Open Access

TL;DR

This paper introduces a stereo camera-based pipeline for large-scale 3D semantic mapping in autonomous driving, integrating visual odometry, global optimization, and semantic labeling, validated on extensive datasets.

Contribution

It presents a novel pipeline combining visual odometry, GNSS integration, and semantic labeling with a temporal voting scheme for improved accuracy.

Findings

01

Effective semantic labeling with a voting scheme

02

Successful large-scale mapping over 8000 km

03

Validated on KITTI-360 dataset

Abstract

In this paper, we present a complete pipeline for 3D semantic mapping solely based on a stereo camera system. The pipeline comprises a direct sparse visual odometry front-end as well as a back-end for global optimization including GNSS integration, and semantic 3D point cloud labeling. We propose a simple but effective temporal voting scheme which improves the quality and consistency of the 3D point labels. Qualitative and quantitative evaluations of our pipeline are performed on the KITTI-360 dataset. The results show the effectiveness of our proposed voting scheme and the capability of our pipeline for efficient large-scale 3D semantic mapping. The large-scale mapping capabilities of our pipeline is furthermore demonstrated by presenting a very large-scale semantic map covering 8000 km of roads generated from data collected by a fleet of vehicles.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage