Towards the Memorization Effect of Neural Networks in Adversarial   Training

Han Xu; Xiaorui Liu; Wentao Wang; Wenbiao Ding; Zhongqin Wu; Zitao; Liu; Anil Jain; Jiliang Tang

arXiv:2106.04794·cs.LG·June 10, 2021·1 cites

Towards the Memorization Effect of Neural Networks in Adversarial Training

Han Xu, Xiaorui Liu, Wentao Wang, Wenbiao Ding, Zhongqin Wu, Zitao, Liu, Anil Jain, Jiliang Tang

PDF

Open Access

TL;DR

This paper investigates the role of memorization in adversarial training of neural networks, revealing that memorizing atypical samples has limited benefits for robustness and can harm performance on typical samples, leading to a new training method.

Contribution

It introduces Benign Adversarial Training (BAT), a novel approach to improve the trade-off between accuracy and robustness by avoiding harmful memorization during adversarial training.

Findings

01

Memorizing atypical samples improves accuracy on atypical data but not robustness.

02

Memorizing certain atypical samples can reduce performance on typical samples.

03

BAT achieves better accuracy-robustness trade-offs on CIFAR100 and Tiny ImageNet.

Abstract

Recent studies suggest that ``memorization'' is one important factor for overparameterized deep neural networks (DNNs) to achieve optimal performance. Specifically, the perfectly fitted DNNs can memorize the labels of many atypical samples, generalize their memorization to correctly classify test atypical samples and enjoy better test performance. While, DNNs which are optimized via adversarial training algorithms can also achieve perfect training performance by memorizing the labels of atypical samples, as well as the adversarially perturbed atypical samples. However, adversarially trained models always suffer from poor generalization, with both relatively low clean accuracy and robustness on the test set. In this work, we study the effect of memorization in adversarial trained DNNs and disclose two important findings: (a) Memorizing atypical samples is only effective to improve DNN's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition