Boosting LiDAR-based Semantic Labeling by Cross-Modal Training Data Generation
Florian Piewak, Peter Pinggera, Manuel Sch\"afer, David Peter, Beate, Schwarz, Nick Schneider, David Pfeiffer, Markus Enzweiler, and Marius, Z\"ollner

TL;DR
This paper introduces LiLaNet, a neural network for LiDAR semantic labeling that leverages cross-modal data generation to improve accuracy while reducing manual annotation efforts.
Contribution
The paper presents a novel neural network architecture called LiLaNet and an automated cross-modal data generation process named Autolabeling, enhancing LiDAR semantic segmentation performance.
Findings
LiLaNet outperforms existing CNN architectures on LiDAR data.
Automated data generation boosts segmentation accuracy by up to 14%.
Large-scale training data improves model robustness.
Abstract
Mobile robots and autonomous vehicles rely on multi-modal sensor setups to perceive and understand their surroundings. Aside from cameras, LiDAR sensors represent a central component of state-of-the-art perception systems. In addition to accurate spatial perception, a comprehensive semantic understanding of the environment is essential for efficient and safe operation. In this paper we present a novel deep neural network architecture called LiLaNet for point-wise, multi-class semantic labeling of semi-dense LiDAR data. The network utilizes virtual image projections of the 3D point clouds for efficient inference. Further, we propose an automated process for large-scale cross-modal training data generation called Autolabeling, in order to boost semantic labeling performance while keeping the manual annotation effort low. The effectiveness of the proposed network architecture as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
