PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer   Architecture

Qiang Zheng; Chao Zhang; Jian Sun

arXiv:2408.05508·cs.CV·September 17, 2024

PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture

Qiang Zheng, Chao Zhang, Jian Sun

PDF

Open Access

TL;DR

PointMT introduces a hybrid MLP-Transformer architecture with linear local attention and adaptive channel weighting to efficiently analyze point clouds, achieving competitive performance with reduced computational costs.

Contribution

This work presents a novel hybrid MLP-Transformer model for point cloud analysis that reduces complexity and enhances convergence speed and feature representation.

Findings

01

Achieves state-of-the-art performance with lower computational cost

02

Introduces linear local attention mechanism for efficiency

03

Enhances convergence speed and feature representation

Abstract

In recent years, point cloud analysis methods based on the Transformer architecture have made significant progress, particularly in the context of multimedia applications such as 3D modeling, virtual reality, and autonomous systems. However, the high computational resource demands of the Transformer architecture hinder its scalability, real-time processing capabilities, and deployment on mobile devices and other platforms with limited computational resources. This limitation remains a significant obstacle to its practical application in scenarios requiring on-device intelligence and multimedia processing. To address this challenge, we propose an efficient point cloud analysis architecture, \textbf{Point} \textbf{M}LP-\textbf{T}ransformer (PointMT). This study tackles the quadratic complexity of the self-attention mechanism by introducing a linear complexity local attention mechanism for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Optical measurement and interference techniques · 3D Shape Modeling and Analysis

MethodsLinear Layer · Residual Connection · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings · Dense Connections