Point Cloud Classification Using Content-based Transformer via   Clustering in Feature Space

Yahui Liu; Bin Tian; Yisheng Lv; Lingxi Li; Feiyue Wang

arXiv:2303.04599·cs.CV·July 21, 2023·6 cites

Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Yahui Liu, Bin Tian, Yisheng Lv, Lingxi Li, Feiyue Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces PointConT, a content-based Transformer for 3D point cloud classification that clusters points in feature space to efficiently model long-range dependencies, achieving high accuracy with reduced computation.

Contribution

It proposes a novel content-based Transformer architecture that clusters points in feature space and uses an Inception feature aggregator for improved point cloud classification.

Findings

01

Achieves 90.3% Top-1 accuracy on ScanObjectNN

02

Effectively models long-range dependencies in point clouds

03

Reduces computational complexity compared to local attention methods

Abstract

Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention, but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space (content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an Inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yahuiliu99/pointcont
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage

MethodsMulti-Head Attention · Attention Is All You Need · fail · Linear Layer · Dropout · Layer Normalization · Residual Connection · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Softmax