Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation
Hanz Cuevas-Velasquez, Antonio Javier Gallego, Robert B. Fisher

TL;DR
This paper introduces a novel two-headed attention mechanism that combines geometric and latent features for improved point cloud segmentation and classification, achieving state-of-the-art results on multiple datasets.
Contribution
The paper proposes Ge-Latto, a two-headed attention layer that effectively integrates local and global features for point cloud analysis, enhancing robustness and invariance.
Findings
Achieves 69.2% IoU on S3DIS Area 5
Reaches 89.7% overall accuracy on S3DIS
Outperforms existing methods on benchmark datasets
Abstract
We present an innovative two-headed attention layer that combines geometric and latent features to segment a 3D scene into semantically meaningful subsets. Each head combines local and global information, using either the geometric or latent features, of a neighborhood of points and uses this information to learn better local relationships. This Geometric-Latent attention layer (Ge-Latto) is combined with a sub-sampling strategy to capture global features. Our method is invariant to permutation thanks to the use of shared-MLP layers, and it can also be used with point clouds with varying densities because the local attention layer does not depend on the neighbor order. Our proposal is simple yet robust, which allows it to achieve competitive results in the ShapeNetPart and ModelNet40 datasets, and the state-of-the-art when segmenting the complex dataset S3DIS, with 69.2% IoU on Area 5,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction
