CpT: Convolutional Point Transformer for 3D Point Cloud Processing
Chaitanya Kaul, Joshua Mitton, Hang Dai, Roderick Murray-Smith

TL;DR
CpT introduces a convolutional point transformer architecture that effectively processes unstructured 3D point cloud data by creating robust, permutation-invariant embeddings through local neighborhood attention, outperforming existing methods.
Contribution
The paper proposes a novel convolutional point transformer with a dynamic graph-based attention mechanism for improved 3D point cloud processing.
Findings
Achieves state-of-the-art results on ModelNet40, ShapeNet Part, and S3DIS datasets.
Demonstrates robustness to point permutations and local neighborhood variations.
Serves as an effective backbone for various 3D point cloud tasks.
Abstract
We present CpT: Convolutional point Transformer - a novel deep learning architecture for dealing with the unstructured nature of 3D point cloud data. CpT is an improvement over existing attention-based Convolutions Neural Networks as well as previous 3D point cloud processing transformers. It achieves this feat due to its effectiveness in creating a novel and robust attention-based point set embedding through a convolutional projection layer crafted for processing dynamically local point set neighbourhoods. The resultant point set embedding is robust to the permutations of the input points. Our novel CpT block builds over local neighbourhoods of points obtained via a dynamic graph computation at each layer of the networks' structure. It is fully differentiable and can be stacked just like convolutional layers to learn global properties of the points. We evaluate our model on standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Adam · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Residual Connection · Dense Connections
