Adaptive-Gravity: A Defense Against Adversarial Samples
Ali Mirzaeian, Zhi Tian, Sai Manoj P D, Banafsheh S. Latibari, Ioannis, Savidis, Houman Homayoun, Avesta Sasan

TL;DR
Adaptive-Gravity introduces a novel training method that enhances neural network robustness against adversarial attacks by increasing class separation through an anti-gravity force mechanism during training.
Contribution
It proposes a new anti-gravity based training approach that improves adversarial robustness and training accuracy simultaneously.
Findings
Significantly reduces fooling rates against multiple attack models.
Improves classification accuracy on benchmark datasets.
Effective across different neural network architectures.
Abstract
This paper presents a novel model training solution, denoted as Adaptive-Gravity, for enhancing the robustness of deep neural network classifiers against adversarial examples. We conceptualize the model parameters/features associated with each class as a mass characterized by its centroid location and the spread (standard deviation of the distance) of features around the centroid. We use the centroid associated with each cluster to derive an anti-gravity force that pushes the centroids of different classes away from one another during network training. Then we customized an objective function that aims to concentrate each class's features toward their corresponding new centroid, which has been obtained by anti-gravity force. This methodology results in a larger separation between different masses and reduces the spread of features around each centroid. As a result, the samples are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsGravity
