MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds
Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst, Possegger, Horst Bischof

TL;DR
MAELi is a self-supervised pre-training framework for large-scale LiDAR point clouds that leverages inherent sparsity and masking strategies to improve 3D perception tasks without requiring annotations.
Contribution
It introduces a novel Masked AutoEncoder architecture tailored for LiDAR data, effectively utilizing sparsity and spherical projection for unsupervised scene understanding.
Findings
Pre-trained models improve 3D detection accuracy.
Unsupervised pre-training reduces annotation dependency.
Effective in semantic segmentation tasks.
Abstract
The sensing process of large-scale LiDAR point clouds inevitably causes large blind spots, i.e. regions not visible to the sensor. We demonstrate how these inherent sampling properties can be effectively utilized for self-supervised representation learning by designing a highly effective pre-training framework that considerably reduces the need for tedious 3D annotations to train state-of-the-art object detectors. Our Masked AutoEncoder for LiDAR point clouds (MAELi) intuitively leverages the sparsity of LiDAR point clouds in both the encoder and decoder during reconstruction. This results in more expressive and useful initialization, which can be directly applied to downstream perception tasks, such as 3D object detection or semantic segmentation for autonomous driving. In a novel reconstruction approach, MAELi distinguishes between empty and occluded space and employs a new masking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Advanced Optical Sensing Technologies
