InfoAT: Improving Adversarial Training Using the Information Bottleneck   Principle

Mengting Xu; Tao Zhang; Zhongnian Li; Daoqiang Zhang

arXiv:2206.12292·cs.LG·June 27, 2022

InfoAT: Improving Adversarial Training Using the Information Bottleneck Principle

Mengting Xu, Tao Zhang, Zhongnian Li, Daoqiang Zhang

PDF

TL;DR

This paper introduces InfoAT, a novel adversarial training method inspired by the information bottleneck principle, which focuses on hard examples with high mutual information to enhance model robustness.

Contribution

The paper proposes a new adversarial training approach that leverages the information bottleneck principle to identify and exploit hard examples for improved robustness.

Findings

01

InfoAT outperforms state-of-the-art methods on multiple datasets.

02

Hard examples with high mutual information are crucial for robustness.

03

Experimental results validate the effectiveness of InfoAT.

Abstract

Adversarial training (AT) has shown excellent high performance in defending against adversarial examples. Recent studies demonstrate that examples are not equally important to the final robustness of models during AT, that is, the so-called hard examples that can be attacked easily exhibit more influence than robust examples on the final robustness. Therefore, guaranteeing the robustness of hard examples is crucial for improving the final robustness of the model. However, defining effective heuristics to search for hard examples is still difficult. In this article, inspired by the information bottleneck (IB) principle, we uncover that an example with high mutual information of the input and its associated latent representation is more likely to be attacked. Based on this observation, we propose a novel and effective adversarial training method (InfoAT). InfoAT is encouraged to find…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.