HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud
Zhixing Hou, Yan Yan, Chengzhong Xu, Hui Kong

TL;DR
This paper introduces HiTPR, a hierarchical transformer-based method for place recognition in point clouds, which effectively captures local and global features to improve SLAM performance.
Contribution
The paper proposes a novel hierarchical transformer architecture for point cloud feature extraction, combining local and global dependencies for enhanced place recognition.
Findings
Achieves 93.71% top 1% recall on Oxford RobotCar dataset.
Outperforms existing methods in average recall rate.
Demonstrates effectiveness of hierarchical transformers in 3D point cloud analysis.
Abstract
Place recognition or loop closure detection is one of the core components in a full SLAM system. In this paper, aiming at strengthening the relevancy of local neighboring points and the contextual dependency among global points simultaneously, we investigate the exploitation of transformer-based network for feature extraction, and propose a Hierarchical Transformer for Place Recognition (HiTPR). The HiTPR consists of four major parts: point cell generation, short-range transformer (SRT), long-range transformer (LRT) and global descriptor aggregation. Specifically, the point cloud is initially divided into a sequence of small cells by downsampling and nearest neighbors searching. In the SRT, we extract the local feature for each point cell. While in the LRT, we build the global dependency among all of the point cells in the whole point cloud. Experiments on several standard benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Layer Normalization · Residual Connection · Softmax
