SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving
Tiago Cortinhal, George Tzelepis, Eren Erdal Aksoy

TL;DR
SalsaNext is a real-time, uncertainty-aware 3D LiDAR semantic segmentation network that improves accuracy and efficiency over previous models, utilizing advanced residual dilated convolutions, a novel decoder, and Bayesian uncertainty estimation.
Contribution
The paper introduces SalsaNext, a novel architecture with residual dilated convolutions and Bayesian uncertainty modeling, achieving state-of-the-art results on Semantic-KITTI.
Findings
Outperforms existing semantic segmentation networks on Semantic-KITTI
Ranks first on the Semantic-KITTI leaderboard
Provides reliable uncertainty estimates for each point
Abstract
In this paper, we introduce SalsaNext for the uncertainty-aware semantic segmentation of a full 3D LiDAR point cloud in real-time. SalsaNext is the next version of SalsaNet [1] which has an encoder-decoder architecture where the encoder unit has a set of ResNet blocks and the decoder part combines upsampled features from the residual blocks. In contrast to SalsaNet, we introduce a new context module, replace the ResNet encoder blocks with a new residual dilated convolution stack with gradually increasing receptive fields and add the pixel-shuffle layer in the decoder. Additionally, we switch from stride convolution to average pooling and also apply central dropout treatment. To directly optimize the Jaccard index, we further combine the weighted cross-entropy loss with Lovasz-Softmax loss [2]. We finally inject a Bayesian treatment to compute the epistemic and aleatoric uncertainties…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Remote Sensing and LiDAR Applications
MethodsDilated Convolution · Lovasz-Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling
