Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds
Yara Ali Alnaggar, Mohamed Afifi, Karim Amer, Mohamed Elhelw

TL;DR
This paper presents a Multi-Projection Fusion framework that combines spherical and bird's-eye view projections for efficient and accurate 3D LiDAR point cloud semantic segmentation, outperforming existing methods in accuracy and speed.
Contribution
It introduces a novel multi-projection fusion approach that leverages multiple views and efficient 2D models to improve segmentation accuracy and computational efficiency.
Findings
Achieved 55.5 mIoU on SemanticKITTI dataset.
Outperformed state-of-the-art projection-based methods in accuracy.
Operates 1.6x to 3.1x faster than comparable methods.
Abstract
Semantic segmentation of 3D point cloud data is essential for enhanced high-level perception in autonomous platforms. Furthermore, given the increasing deployment of LiDAR sensors onboard of cars and drones, a special emphasis is also placed on non-computationally intensive algorithms that operate on mobile GPUs. Previous efficient state-of-the-art methods relied on 2D spherical projection of point clouds as input for 2D fully convolutional neural networks to balance the accuracy-speed trade-off. This paper introduces a novel approach for 3D point cloud semantic segmentation that exploits multiple projections of the point cloud to mitigate the loss of information inherent in single projection methods. Our Multi-Projection Fusion (MPF) framework analyzes spherical and bird's-eye view projections using two separate highly-efficient 2D fully convolutional models then combines the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPolarNet
