LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception
Dongqiangzi Ye, Zixiang Zhou, Weijia Chen, Yufei Xie, Yu Wang, Panqu, Wang, Hassan Foroosh

TL;DR
LidarMultiNet is a unified, end-to-end multi-task network for LiDAR perception that achieves state-of-the-art results across detection, segmentation, and panoptic segmentation tasks by sharing a strong encoder-decoder architecture and task-specific heads.
Contribution
The paper introduces LidarMultiNet, a novel multi-task LiDAR perception network that unifies three major tasks with competitive performance and minimal additional cost, outperforming specialized single-task models.
Findings
Achieves state-of-the-art results on Waymo and nuScenes datasets.
Wins first place in the Waymo 3D semantic segmentation challenge 2022.
Sets new benchmarks for LiDAR-based object detection and segmentation.
Abstract
LiDAR-based 3D object detection, semantic segmentation, and panoptic segmentation are usually implemented in specialized networks with distinctive architectures that are difficult to adapt to each other. This paper presents LidarMultiNet, a LiDAR-based multi-task network that unifies these three major LiDAR perception tasks. Among its many benefits, a multi-task network can reduce the overall cost by sharing weights and computation among multiple tasks. However, it typically underperforms compared to independently combined single-task models. The proposed LidarMultiNet aims to bridge the performance gap between the multi-task network and multiple single-task networks. At the core of LidarMultiNet is a strong 3D voxel-based encoder-decoder architecture with a Global Context Pooling (GCP) module extracting global contextual features from a LiDAR frame. Task-specific heads are added on top…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsTest
