LiDAR-BEVMTN: Real-Time LiDAR Bird's-Eye View Multi-Task Perception Network for Autonomous Driving
Sambit Mohapatra, Senthil Yogamani, Varun Ravi Kumar, Stefan Milz,, Heinrich Gotzig, Patrick M\"ader

TL;DR
This paper introduces a real-time, multi-task LiDAR perception network for autonomous driving that unifies detection, segmentation, and motion estimation with high efficiency on embedded platforms.
Contribution
It presents a novel multi-task CNN architecture with a SWAG module and heterogeneous training, achieving state-of-the-art results and real-time performance on embedded hardware.
Findings
Achieves 3ms latency on NVIDIA Xavier platform.
State-of-the-art semantic and motion segmentation results.
Near state-of-the-art 3D object detection performance.
Abstract
LiDAR is crucial for robust 3D scene perception in autonomous driving. LiDAR perception has the largest body of literature after camera perception. However, multi-task learning across tasks like detection, segmentation, and motion estimation using LiDAR remains relatively unexplored, especially on automotive-grade embedded platforms. We present a real-time multi-task convolutional neural network for LiDAR-based object detection, semantics, and motion segmentation. The unified architecture comprises a shared encoder and task-specific decoders, enabling joint representation learning. We propose a novel Semantic Weighting and Guidance (SWAG) module to transfer semantic features for improved object detection selectively. Our heterogeneous training scheme combines diverse datasets and exploits complementary cues between tasks. The work provides the first embedded implementation unifying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Robotics and Sensor-Based Localization
