Adversarial Information Bottleneck
Penglong Zhai, Shihua Zhang

TL;DR
This paper introduces an adversarial information bottleneck (AIB) method that enhances robustness against adversarial attacks by optimizing a Min-Max problem, improving invariant representations and revealing optimal IB hyperparameters.
Contribution
The paper proposes a novel AIB approach that does not assume distributional forms and effectively improves adversarial robustness through a Min-Max optimization framework.
Findings
AIB outperforms existing IB methods in resisting adversarial perturbations.
IB models at the knee point of the IB curve balance compression and robustness.
Analysis shows hyperparameters at the IB curve's knee point yield optimal trade-offs.
Abstract
The information bottleneck (IB) principle has been adopted to explain deep learning in terms of information compression and prediction, which are balanced by a trade-off hyperparameter. How to optimize the IB principle for better robustness and figure out the effects of compression through the trade-off hyperparameter are two challenging problems. Previous methods attempted to optimize the IB principle by introducing random noise into learning the representation and achieved state-of-the-art performance in the nuisance information compression and semantic information extraction. However, their performance on resisting adversarial perturbations is far less impressive. To this end, we propose an adversarial information bottleneck (AIB) method without any explicit assumptions about the underlying distribution of the representations, which can be optimized effectively by solving a Min-Max…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
