Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions
Jie Wang, Tingfa Xu, Lihe Ding, Jianan Li

TL;DR
This paper introduces APCT, a transformer-based model for 3D point cloud recognition that enhances robustness against real-world corruptions by using adversarial feature erasing guided by pattern analysis during training.
Contribution
The paper proposes a novel architecture with adversarial feature erasing mechanisms, including an Adversarial Significance Identifier and a Target-guided Promptor, to improve robustness in 3D recognition tasks.
Findings
Achieves state-of-the-art results on multiple corruption benchmarks.
Effectively identifies and integrates diverse object patterns during training.
Enhances global structure capture in point cloud recognition models.
Abstract
Achieving robust 3D perception in the face of corrupted data presents an challenging hurdle within 3D vision research. Contemporary transformer-based point cloud recognition models, albeit advanced, tend to overfit to specific patterns, consequently undermining their robustness against corruption. In this work, we introduce the Target-Guided Adversarial Point Cloud Transformer, termed APCT, a novel architecture designed to augment global structure capture through an adversarial feature erasing mechanism predicated on patterns discerned at each step during training. Specifically, APCT integrates an Adversarial Significance Identifier and a Target-guided Promptor. The Adversarial Significance Identifier, is tasked with discerning token significance by integrating global contextual analysis, utilizing a structural salience index algorithm alongside an auxiliary supervisory mechanism. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · Industrial Vision Systems and Defect Detection · 3D Surveying and Cultural Heritage
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings
