SeqOT: A Spatial-Temporal Transformer Network for Place Recognition Using Sequential LiDAR Data
Junyi Ma, Xieyuanli Chen, Jingyi Xu, Guangming Xiong

TL;DR
SeqOT introduces a transformer-based approach for place recognition using sequential LiDAR data, achieving superior accuracy and real-time performance across diverse environments.
Contribution
The paper proposes SeqOT, a novel transformer network that effectively exploits spatial-temporal information from sequential LiDAR scans for place recognition.
Findings
Outperforms state-of-the-art LiDAR-based methods
Generalizes well across different environments
Operates faster than the sensor frame rate
Abstract
Place recognition is an important component for autonomous vehicles to achieve loop closing or global localization. In this paper, we tackle the problem of place recognition based on sequential 3D LiDAR scans obtained by an onboard LiDAR sensor. We propose a transformer-based network named SeqOT to exploit the temporal and spatial information provided by sequential range images generated from the LiDAR data. It uses multi-scale transformers to generate a global descriptor for each sequence of LiDAR range images in an end-to-end fashion. During online operation, our SeqOT finds similar places by matching such descriptors between the current query sequence and those stored in the map. We evaluate our approach on four datasets collected with different types of LiDAR sensors in different environments. The experimental results show that our method outperforms the state-of-the-art LiDAR-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Video Surveillance and Tracking Methods
