LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception

Dongqiangzi Ye; Zixiang Zhou; Weijia Chen; Yufei Xie; Yu Wang; Panqu; Wang; Hassan Foroosh

arXiv:2209.09385·cs.CV·March 23, 2023·6 cites

LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception

Dongqiangzi Ye, Zixiang Zhou, Weijia Chen, Yufei Xie, Yu Wang, Panqu, Wang, Hassan Foroosh

PDF

Open Access 1 Video

TL;DR

LidarMultiNet is a unified, end-to-end multi-task network for LiDAR perception that achieves state-of-the-art results across detection, segmentation, and panoptic segmentation tasks by sharing a strong encoder-decoder architecture and task-specific heads.

Contribution

The paper introduces LidarMultiNet, a novel multi-task LiDAR perception network that unifies three major tasks with competitive performance and minimal additional cost, outperforming specialized single-task models.

Findings

01

Achieves state-of-the-art results on Waymo and nuScenes datasets.

02

Wins first place in the Waymo 3D semantic segmentation challenge 2022.

03

Sets new benchmarks for LiDAR-based object detection and segmentation.

Abstract

LiDAR-based 3D object detection, semantic segmentation, and panoptic segmentation are usually implemented in specialized networks with distinctive architectures that are difficult to adapt to each other. This paper presents LidarMultiNet, a LiDAR-based multi-task network that unifies these three major LiDAR perception tasks. Among its many benefits, a multi-task network can reduce the overall cost by sharing weights and computation among multiple tasks. However, it typically underperforms compared to independently combined single-task models. The proposed LidarMultiNet aims to bridge the performance gap between the multi-task network and multiple single-task networks. At the core of LidarMultiNet is a strong 3D voxel-based encoder-decoder architecture with a Global Context Pooling (GCP) module extracting global contextual features from a LiDAR frame. Task-specific heads are added on top…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LidarMultiNet: Towards a Unified Multi-task Network for LiDAR Perception· underline

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsTest