Sparsity-based Defense against Adversarial Attacks on Linear Classifiers

Zhinus Marzi; Soorya Gopalakrishnan; Upamanyu Madhow; Ramtin Pedarsani

arXiv:1801.04695·stat.ML·June 20, 2018

Sparsity-based Defense against Adversarial Attacks on Linear Classifiers

Zhinus Marzi, Soorya Gopalakrishnan, Upamanyu Madhow, Ramtin Pedarsani

PDF

3 Repos

TL;DR

This paper proposes a sparsity-based defense mechanism for linear classifiers against adversarial attacks, demonstrating both theoretical rigor and empirical effectiveness on MNIST, addressing a key vulnerability in deep learning models.

Contribution

It introduces a novel sparsity-based approach to defend against adversarial attacks in linear classifiers, supported by theoretical analysis and experimental validation.

Findings

01

Sparsity can effectively mitigate $$-bounded adversarial perturbations.

02

The sparsifying front end improves robustness in linear classifiers.

03

First theoretical framework linking sparsity to adversarial defense.

Abstract

Deep neural networks represent the state of the art in machine learning in a growing number of fields, including vision, speech and natural language processing. However, recent work raises important questions about the robustness of such architectures, by showing that it is possible to induce classification errors through tiny, almost imperceptible, perturbations. Vulnerability to such "adversarial attacks", or "adversarial examples", has been conjectured to be due to the excessive linearity of deep networks. In this paper, we study this phenomenon in the setting of a linear classifier, and show that it is possible to exploit sparsity in natural data to combat $ℓ_{\infty}$ -bounded adversarial perturbations. Specifically, we demonstrate the efficacy of a sparsifying front end via an ensemble averaged analysis, and experimental results for the MNIST handwritten digit database. To the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.