Slice Transformer and Self-supervised Learning for 6DoF Localization in   3D Point Cloud Maps

Muhammad Ibrahim; Naveed Akhtar; Saeed Anwar; Michael Wise; Ajmal; Mian

arXiv:2301.08957·cs.CV·August 15, 2023

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal, Mian

PDF

Open Access

TL;DR

This paper introduces a novel self-supervised learning approach using Transformers for outdoor 6DoF localization with LiDAR data, featuring a new dataset and demonstrating improved performance on multiple benchmarks.

Contribution

It presents the first use of multi-head attention Transformers for outdoor LiDAR localization and introduces the Perth-WA dataset for large-scale urban mapping.

Findings

01

Effective self-supervised learning for LiDAR-based localization

02

First application of Transformers in outdoor point cloud localization

03

Improved object classification performance on ModelNet40 and ScanNN

Abstract

Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $36 0^{\circ}$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systematically processing the slices. To the best of our knowledge, this is the first instance of leveraging multi-head attention for outdoor point clouds. We additionally introduce the Perth-WA dataset, which provides a large-scale LiDAR map of Perth city in Western Australia, covering $\sim$ 4km $^{2}$ area. Localization annotations are provided for Perth-WA. The proposed localization method is thoroughly evaluated on Perth-WA and Appollo-SouthBay datasets. We also establish the efficacy of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Absolute Position Encodings · Dropout · Label Smoothing · Byte Pair Encoding