S3Net: 3D LiDAR Sparse Semantic Segmentation Network
Ran Cheng, Ryan Razani, Yuan Ren, Liu Bingbing

TL;DR
S3Net is a novel 3D LiDAR semantic segmentation network that leverages sparse attention modules and a geo-aware loss to improve accuracy and boundary delineation in autonomous driving scenarios.
Contribution
The paper introduces S3Net, which employs sparse attention modules and a geo-aware loss, offering a new effective approach for 3D LiDAR semantic segmentation.
Findings
Achieves 12% improvement over MinkNet42 baseline.
Attains state-of-the-art mIoU accuracy on SemanticKITTI.
Effectively emphasizes semantic boundaries and reduces noise.
Abstract
Semantic Segmentation is a crucial component in the perception systems of many applications, such as robotics and autonomous driving that rely on accurate environmental perception and understanding. In literature, several approaches are introduced to attempt LiDAR semantic segmentation task, such as projection-based (range-view or birds-eye-view), and voxel-based approaches. However, they either abandon the valuable 3D topology and geometric relations and suffer from information loss introduced in the projection process or are inefficient. Therefore, there is a need for accurate models capable of processing the 3D driving-scene point cloud in 3D space. In this paper, we propose S3Net, a novel convolutional neural network for LiDAR point cloud semantic segmentation. It adopts an encoder-decoder backbone that consists of Sparse Intra-channel Attention Module (SIntraAM), and Sparse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
MethodsConvolution
