SP$^2$T: Sparse Proxy Attention for Dual-stream Point Transformer
Jiaxu Wan, Hong Zhang, Ziqi He, Yangyan Deng, Qishu Wang, Ding Yuan, Yifan Yang

TL;DR
SP$^2$T introduces a novel dual-stream point transformer with sparse proxy attention, improving 3D point cloud understanding by balancing local-global features efficiently and achieving state-of-the-art results.
Contribution
The paper proposes Sparse Proxy Point Transformer (SP$^2$T), featuring spatial proxy sampling, sparse proxy attention, and a dual-stream architecture for enhanced 3D understanding.
Findings
Achieves state-of-the-art results on indoor and outdoor benchmarks.
Improves mIoU by +3.8% on S3DIS and +22.9% on Sem.KITTI.
Demonstrates efficient proxy interaction with table-based relative bias.
Abstract
Point transformers have demonstrated remarkable progress in 3D understanding through expanded receptive fields (RF), but further expanding the RF leads to dilution in group attention and decreases detailed feature extraction capability. Proxy, which serves as abstract representations for simplifying feature maps, enables global RF. However, existing proxy-based approaches face critical limitations: Global proxies incur quadratic complexity for large-scale point clouds and suffer positional ambiguity, while local proxy alternatives struggle with 1) Unreliable sampling from the geometrically diverse point cloud, 2) Inefficient proxy interaction computation, and 3) Imbalanced local-global information fusion; To address these challenges, we propose Sparse Proxy Point Transformer (SPT) -- a local proxy-based dual-stream point transformer with three key innovations: First, for reliable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSensor Technology and Measurement Systems · Advanced Optical Sensing Technologies · Optical Systems and Laser Technology
MethodsSoftmax · Attention Is All You Need
