Point Cloud Recognition with Position-to-Structure Attention Transformers
Zheng Ding, James Hou, Zhuowen Tu

TL;DR
This paper introduces PS-Former, a Transformer-based model for 3D point cloud recognition that automatically extracts features and enriches structural information without extensive feature engineering.
Contribution
The novel PS-Former architecture integrates a learnable condensation layer and a Position-to-Structure Attention mechanism for improved 3D point cloud analysis.
Findings
Competitive results on classification, part segmentation, and scene segmentation tasks.
Less reliance on heuristic feature engineering.
Effective structural information enrichment.
Abstract
In this paper, we present Position-to-Structure Attention Transformers (PS-Former), a Transformer-based algorithm for 3D point cloud recognition. PS-Former deals with the challenge in 3D point cloud representation where points are not positioned in a fixed grid structure and have limited feature description (only 3D coordinates () for scattered points). Existing Transformer-based architectures in this domain often require a pre-specified feature engineering step to extract point features. Here, we introduce two new aspects in PS-Former: 1) a learnable condensation layer that performs point downsampling and feature extraction; and 2) a Position-to-Structure Attention mechanism that recursively enriches the structural information with the position attention branch. Compared with the competing methods, while being generic with less heuristics feature designs, PS-Former…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications
