Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps
Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal, Mian

TL;DR
This paper introduces a novel self-supervised learning approach using Transformers for outdoor 6DoF localization with LiDAR data, featuring a new dataset and demonstrating improved performance on multiple benchmarks.
Contribution
It presents the first use of multi-head attention Transformers for outdoor LiDAR localization and introduces the Perth-WA dataset for large-scale urban mapping.
Findings
Effective self-supervised learning for LiDAR-based localization
First application of Transformers in outdoor point cloud localization
Improved object classification performance on ModelNet40 and ScanNN
Abstract
Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systematically processing the slices. To the best of our knowledge, this is the first instance of leveraging multi-head attention for outdoor point clouds. We additionally introduce the Perth-WA dataset, which provides a large-scale LiDAR map of Perth city in Western Australia, covering 4km area. Localization annotations are provided for Perth-WA. The proposed localization method is thoroughly evaluated on Perth-WA and Appollo-SouthBay datasets. We also establish the efficacy of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Absolute Position Encodings · Dropout · Label Smoothing · Byte Pair Encoding
