HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud

Zhixing Hou; Yan Yan; Chengzhong Xu; Hui Kong

arXiv:2204.05481·cs.CV·April 13, 2022

HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud

Zhixing Hou, Yan Yan, Chengzhong Xu, Hui Kong

PDF

Open Access

TL;DR

This paper introduces HiTPR, a hierarchical transformer-based method for place recognition in point clouds, which effectively captures local and global features to improve SLAM performance.

Contribution

The paper proposes a novel hierarchical transformer architecture for point cloud feature extraction, combining local and global dependencies for enhanced place recognition.

Findings

01

Achieves 93.71% top 1% recall on Oxford RobotCar dataset.

02

Outperforms existing methods in average recall rate.

03

Demonstrates effectiveness of hierarchical transformers in 3D point cloud analysis.

Abstract

Place recognition or loop closure detection is one of the core components in a full SLAM system. In this paper, aiming at strengthening the relevancy of local neighboring points and the contextual dependency among global points simultaneously, we investigate the exploitation of transformer-based network for feature extraction, and propose a Hierarchical Transformer for Place Recognition (HiTPR). The HiTPR consists of four major parts: point cell generation, short-range transformer (SRT), long-range transformer (LRT) and global descriptor aggregation. Specifically, the point cloud is initially divided into a sequence of small cells by downsampling and nearest neighbors searching. In the SRT, we extract the local feature for each point cell. While in the LRT, we build the global dependency among all of the point cells in the whole point cloud. Experiments on several standard benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Layer Normalization · Residual Connection · Softmax