PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture
Qiang Zheng, Chao Zhang, Jian Sun

TL;DR
PointMT introduces a hybrid MLP-Transformer architecture with linear local attention and adaptive channel weighting to efficiently analyze point clouds, achieving competitive performance with reduced computational costs.
Contribution
This work presents a novel hybrid MLP-Transformer model for point cloud analysis that reduces complexity and enhances convergence speed and feature representation.
Findings
Achieves state-of-the-art performance with lower computational cost
Introduces linear local attention mechanism for efficiency
Enhances convergence speed and feature representation
Abstract
In recent years, point cloud analysis methods based on the Transformer architecture have made significant progress, particularly in the context of multimedia applications such as 3D modeling, virtual reality, and autonomous systems. However, the high computational resource demands of the Transformer architecture hinder its scalability, real-time processing capabilities, and deployment on mobile devices and other platforms with limited computational resources. This limitation remains a significant obstacle to its practical application in scenarios requiring on-device intelligence and multimedia processing. To address this challenge, we propose an efficient point cloud analysis architecture, \textbf{Point} \textbf{M}LP-\textbf{T}ransformer (PointMT). This study tackles the quadratic complexity of the self-attention mechanism by introducing a linear complexity local attention mechanism for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Optical measurement and interference techniques · 3D Shape Modeling and Analysis
MethodsLinear Layer · Residual Connection · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings · Dense Connections
