Random Projections for Improved Adversarial Robustness

Ginevra Carbone; Guido Sanguinetti; Luca Bortolussi

arXiv:2102.09230·cs.LG·April 28, 2021

Random Projections for Improved Adversarial Robustness

Ginevra Carbone, Guido Sanguinetti, Luca Bortolussi

PDF

TL;DR

This paper introduces two novel training methods using random projections to enhance neural network robustness against adversarial attacks, leveraging geometric properties and dimensionality reduction.

Contribution

The paper presents two new techniques, RP-Ensemble and RP-Regularizer, that improve adversarial robustness independently of attack type using random projections.

Findings

01

RP-Ensemble improves robustness through ensemble learning.

02

RP-Regularizer enhances robustness via regularization.

03

Both methods are attack-agnostic.

Abstract

We propose two training techniques for improving the robustness of Neural Networks to adversarial attacks, i.e. manipulations of the inputs that are maliciously crafted to fool networks into incorrect predictions. Both methods are independent of the chosen attack and leverage random projections of the original inputs, with the purpose of exploiting both dimensionality reduction and some characteristic geometrical properties of adversarial perturbations. The first technique is called RP-Ensemble and consists of an ensemble of networks trained on multiple projected versions of the original inputs. The second one, named RP-Regularizer, adds instead a regularization term to the training objective.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.