Omnipotent Adversarial Training in the Wild

Guanlin Li; Kangjie Chen; Yuan Xu; Han Qiu; Tianwei Zhang

arXiv:2307.08596·cs.LG·December 5, 2023

Omnipotent Adversarial Training in the Wild

Guanlin Li, Kangjie Chen, Yuan Xu, Han Qiu, Tianwei Zhang

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces Omnipotent Adversarial Training (OAT), a novel approach that improves both clean accuracy and adversarial robustness in imbalanced and noisy datasets by using an oracle and logits adjustment.

Contribution

The paper proposes a new adversarial training strategy with an oracle and logits adjustment to handle real-world data imperfections, enhancing robustness and accuracy.

Findings

01

OAT outperforms baselines by over 20% in clean accuracy.

02

OAT achieves more than 10% improvement in robust accuracy.

03

Effective in complex data imbalance and noise scenarios.

Abstract

Adversarial training is an important topic in robust deep learning, but the community lacks attention to its practical usage. In this paper, we aim to resolve a real-world challenge, i.e., training a model on an imbalanced and noisy dataset to achieve high clean accuracy and adversarial robustness, with our proposed Omnipotent Adversarial Training (OAT) strategy. OAT consists of two innovative methodologies to address the imperfection in the training set. We first introduce an oracle into the adversarial training process to help the model learn a correct data-label conditional distribution. This carefully-designed oracle can provide correct label annotations for adversarial training. We further propose logits adjustment adversarial training to overcome the data imbalance issue, which can help the model learn a Bayes-optimal distribution. Our comprehensive evaluation results show that…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

The problems of label noise, sample imbalance, and adversarial sample are simultaneously considered in training a model.

Weaknesses

Sample labeling noise, sample imbalance, and adversarial sample are problems that have been studied separately, and a number of related methods have been proposed. The proposed method in this draft is a simple combination of existing techniques. The sample label correction, data re-sampling, contrastive learning, and adversarial training included in the proposed method are conventional methods. Overall, not much technological innovation has been seen in this manuscript.

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

Strength: 1. This paper explores a new setting for adversarial training where training data could be contaminated with noise and imbalanced.

Weaknesses

Weakness: 1. The proposed method does not unify adversarial training with learning from noisy and long-tailed data. These two parts are seemly optimized separately. 2. The techniques introduced for learning from noisy and long-tailed data are not new. Re-sampling is a well-known method for learning from imbalanced data. Label refurbishment is also widely adopted in self-training. Contrastive training is also quite matured. The technical contribution is thus very limited. 3. The evaluation of

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

+ This paper is the first trial to achieve better clean and robust accuracy for adversarial training under the noisy and imbalanced conditions. + Multiple techniques are efficiently combined for better adversarial training and the key components in this work are all reasonable. + The effectiveness of the proposed method is clearly shown with CIFAR-10 and 100 by significantly outperforming the baseline methods in terms of both clean accuracy and robust accuracy.

Weaknesses

1. I am afraid the technical novelty is relatively weak. - Data-resampling (over-sampling, sub-sampling, SMOTE, etc) can be found in many papers. - The label refurbishment and dataset split using k-NN seems essentially the same as in the following paper. If not, the authors should clarify the novelty more clearly. [a] Dara Bahri, Heinrich Jiang, and Maya Gupta. Deep k-nn for noisy labels, Proceedings of Machine Learning Research, 2020. [b] Chen Feng, Georgios Tzimiropoulos, Ioannis Patras. SSR:

Code & Models

Repositories

guanlinlee/oat
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning