Robust Classification Under $\ell_0$ Attack for the Gaussian Mixture Model
Payam Delgosha, Hamed Hassani, Ramtin Pedarsani

TL;DR
This paper studies robust classification under sparse adversarial attacks in Gaussian mixture models, proposing a new algorithm and characterizing optimal error bounds and phase transitions as data dimension grows.
Contribution
It introduces the FilTrun algorithm for $ ext{ extl{}0}$-bounded attacks and provides theoretical bounds and asymptotic optimality results for Gaussian mixture models.
Findings
Derived upper bounds on robust classification error.
Established lower bounds and characterized asymptotic optimality.
Identified phase transitions in adversary's budget impact.
Abstract
It is well-known that machine learning models are vulnerable to small but cleverly-designed adversarial perturbations that can cause misclassification. While there has been major progress in designing attacks and defenses for various adversarial settings, many fundamental and theoretical problems are yet to be resolved. In this paper, we consider classification in the presence of -bounded adversarial perturbations, a.k.a. sparse attacks. This setting is significantly different from other -adversarial settings, with , as the -ball is non-convex and highly non-smooth. Under the assumption that data is distributed according to the Gaussian mixture model, our goal is to characterize the optimal robust classifier and the corresponding robust classification error as well as a variety of trade-offs between robustness, accuracy, and the adversary's budget. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
