Efficient Point Transformer with Dynamic Token Aggregating for LiDAR Point Cloud Processing
Dening Lu, Jun Zhou, Kyle (Yilin) Gao, Linlin Xu, Jonathan Li

TL;DR
This paper introduces DTA-Former, an efficient 3D Transformer model for LiDAR point cloud processing that reduces computational costs through dynamic token aggregation and adaptive sparsification, achieving high classification accuracy.
Contribution
The paper presents a novel DTA-Former architecture with learnable token sparsification, dynamic token aggregation, and iterative token reconstruction for improved efficiency and performance in 3D point cloud analysis.
Findings
Achieves state-of-the-art classification accuracy on LiDAR datasets.
Reduces computational complexity compared to existing 3D Transformer methods.
Demonstrates effective dense prediction with the proposed ITR and W-net architecture.
Abstract
Recently, LiDAR point cloud processing and analysis have made great progress due to the development of 3D Transformers. However, existing 3D Transformer methods usually are computationally expensive and inefficient due to their huge and redundant attention maps. They also tend to be slow due to requiring time-consuming point cloud sampling and grouping processes. To address these issues, we propose an efficient point TransFormer with Dynamic Token Aggregating (DTA-Former) for point cloud representation and processing. Firstly, we propose an efficient Learnable Token Sparsification (LTS) block, which considers both local and global semantic information for the adaptive selection of key tokens. Secondly, to achieve the feature aggregation for sparsified tokens, we present the first Dynamic Token Aggregating (DTA) block in the 3D Transformer paradigm, providing our model with strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Sensing Technologies · 3D Surveying and Cultural Heritage · Optical measurement and interference techniques
MethodsConcatenated Skip Connection · Max Pooling · Convolution · Byte Pair Encoding · Linear Layer · Absolute Position Encodings · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Softmax · Attention Is All You Need
