Optimizing Information Loss Towards Robust Neural Networks
Philip Sperl, Konstantin B\"ottinger

TL;DR
This paper introduces entropic retraining, a novel method inspired by information theory, to enhance neural network robustness against adversarial attacks without generating adversarial examples, reducing training complexity.
Contribution
The paper proposes entropic retraining, a new approach that improves neural network robustness efficiently by avoiding adversarial example generation, based on an information-theoretic analysis.
Findings
Entropic retraining significantly increases neural network security.
It achieves robustness comparable to adversarial training without generating adversarial examples.
Effective across various architectures and datasets.
Abstract
Neural Networks (NNs) are vulnerable to adversarial examples. Such inputs differ only slightly from their benign counterparts yet provoke misclassifications of the attacked NNs. The required perturbations to craft the examples are often negligible and even human imperceptible. To protect deep learning-based systems from such attacks, several countermeasures have been proposed with adversarial training still being considered the most effective. Here, NNs are iteratively retrained using adversarial examples forming a computational expensive and time consuming process often leading to a performance decrease. To overcome the downsides of adversarial training while still providing a high level of security, we present a new training approach we call \textit{entropic retraining}. Based on an information-theoretic-inspired analysis, entropic retraining mimics the effects of adversarial training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
