Target-Guided Adversarial Point Cloud Transformer Towards Recognition   Against Real-world Corruptions

Jie Wang; Tingfa Xu; Lihe Ding; Jianan Li

arXiv:2411.00462·cs.CV·November 4, 2024

Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions

Jie Wang, Tingfa Xu, Lihe Ding, Jianan Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces APCT, a transformer-based model for 3D point cloud recognition that enhances robustness against real-world corruptions by using adversarial feature erasing guided by pattern analysis during training.

Contribution

The paper proposes a novel architecture with adversarial feature erasing mechanisms, including an Adversarial Significance Identifier and a Target-guided Promptor, to improve robustness in 3D recognition tasks.

Findings

01

Achieves state-of-the-art results on multiple corruption benchmarks.

02

Effectively identifies and integrates diverse object patterns during training.

03

Enhances global structure capture in point cloud recognition models.

Abstract

Achieving robust 3D perception in the face of corrupted data presents an challenging hurdle within 3D vision research. Contemporary transformer-based point cloud recognition models, albeit advanced, tend to overfit to specific patterns, consequently undermining their robustness against corruption. In this work, we introduce the Target-Guided Adversarial Point Cloud Transformer, termed APCT, a novel architecture designed to augment global structure capture through an adversarial feature erasing mechanism predicated on patterns discerned at each step during training. Specifically, APCT integrates an Adversarial Significance Identifier and a Target-guided Promptor. The Adversarial Significance Identifier, is tasked with discerning token significance by integrating global contextual analysis, utilizing a structural salience index algorithm alongside an auxiliary supervisory mechanism. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

roywangj/apct
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote Sensing and LiDAR Applications · Industrial Vision Systems and Defect Detection · 3D Surveying and Cultural Heritage

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings