LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation   and Detection

Jiahua Xu; Si Zuo; Chenfeng Wei; Wei Zhou

arXiv:2406.07023·cs.CV·June 13, 2024·1 cites

LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection

Jiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou

PDF

Open Access

TL;DR

LiSD is a novel multi-task learning framework that efficiently combines lidar segmentation and detection tasks, achieving state-of-the-art results by leveraging hierarchical feature collaboration and cross-task information sharing.

Contribution

This work introduces LiSD, the first efficient multi-task framework for lidar segmentation and detection that integrates hierarchical and holistic modules for improved performance.

Findings

01

Achieves 83.3% mIoU on nuScenes segmentation benchmark.

02

Outperforms existing methods on nuScenes and Waymo datasets.

03

Effectively balances sparsity and density in feature integration.

Abstract

With the rapid proliferation of autonomous driving, there has been a heightened focus on the research of lidar-based 3D semantic segmentation and object detection methodologies, aiming to ensure the safety of traffic participants. In recent decades, learning-based approaches have emerged, demonstrating remarkable performance gains in comparison to conventional algorithms. However, the segmentation and detection tasks have traditionally been examined in isolation to achieve the best precision. To this end, we propose an efficient multi-task learning framework named LiSD which can address both segmentation and detection tasks, aiming to optimize the overall performance. Our proposed LiSD is a voxel-based encoder-decoder framework that contains a hierarchical feature collaboration module and a holistic information aggregation module. Different integration methods are adopted to keep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications

MethodsFocus