Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
Zihao Li, Pan Gao, Kang You, Chuan Yan, Manoranjan Paul

TL;DR
This paper introduces a novel point cloud analysis network that employs global attention-guided dual-domain feature learning to improve input embedding and aggregation efficiency, enhancing classification and segmentation tasks.
Contribution
It proposes the CPT module with global attention for better input embedding and the DKFF method for dual-domain feature aggregation, addressing key limitations in existing point cloud models.
Findings
Achieves superior performance on classification and segmentation tasks
Demonstrates the effectiveness of CPT and DKFF modules
Outperforms existing methods in multiple point cloud benchmarks
Abstract
Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Remote Sensing and LiDAR Applications · 3D Shape Modeling and Analysis
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Linear Layer · Label Smoothing · Adam · Dropout · Multi-Head Attention · Dense Connections · Softmax
