Multi-scale Geometry-aware Transformer for 3D Point Cloud Classification
Xian Wei, Muyu Wang, Shing-Ho Jonathan Lin, Zhengyu Li, Jian Yang,, Arafat Al-Jawari, Xuan Tang

TL;DR
This paper introduces the Multi-scale Geometry-aware Transformer (MGT), a novel self-attention based module that effectively captures multi-scale non-Euclidean geometric structures in 3D point clouds, enhancing classification performance.
Contribution
The paper proposes MGT, a new self-attention module that processes multi-scale local and global geometric information in point clouds, addressing limitations of existing methods.
Findings
MGT significantly improves point cloud classification accuracy.
MGT effectively captures multi-scale geometric features.
The method achieves competitive results on benchmark datasets.
Abstract
Self-attention modules have demonstrated remarkable capabilities in capturing long-range relationships and improving the performance of point cloud tasks. However, point cloud objects are typically characterized by complex, disordered, and non-Euclidean spatial structures with multiple scales, and their behavior is often dynamic and unpredictable. The current self-attention modules mostly rely on dot product multiplication and dimension alignment among query-key-value features, which cannot adequately capture the multi-scale non-Euclidean structures of point cloud objects. To address these problems, this paper proposes a self-attention plug-in module with its variants, Multi-scale Geometry-aware Transformer (MGT). MGT processes point cloud data with multi-scale local and global geometric information in the following three aspects. At first, the MGT divides point cloud data into patches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Optical measurement and interference techniques
MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection
