Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space
Yahui Liu, Bin Tian, Yisheng Lv, Lingxi Li, Feiyue Wang

TL;DR
This paper introduces PointConT, a content-based Transformer for 3D point cloud classification that clusters points in feature space to efficiently model long-range dependencies, achieving high accuracy with reduced computation.
Contribution
It proposes a novel content-based Transformer architecture that clusters points in feature space and uses an Inception feature aggregator for improved point cloud classification.
Findings
Achieves 90.3% Top-1 accuracy on ScanObjectNN
Effectively models long-range dependencies in point clouds
Reduces computational complexity compared to local attention methods
Abstract
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention, but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space (content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an Inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage
MethodsMulti-Head Attention · Attention Is All You Need · fail · Linear Layer · Dropout · Layer Normalization · Residual Connection · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Softmax
