Learning perturbation sets for robust machine learning
Eric Wong, J. Zico Kolter

TL;DR
This paper introduces a method to learn data-driven perturbation sets using a conditional variational autoencoder, enabling more realistic robustness training against diverse real-world image variations.
Contribution
It proposes a framework to learn perturbation sets from data with theoretical guarantees, improving robustness and generalization in deep learning models.
Findings
Models generate diverse, meaningful perturbations beyond training data
Learned perturbation sets improve robustness to adversarial corruptions
Enhanced model generalization on non-adversarial data
Abstract
Although much progress has been made towards robust deep learning, a significant gap in robustness remains between real-world perturbations and more narrowly defined sets typically studied in adversarial defenses. In this paper, we aim to bridge this gap by learning perturbation sets from data, in order to characterize real-world effects for robust training and evaluation. Specifically, we use a conditional generator that defines the perturbation set over a constrained region of the latent space. We formulate desirable properties that measure the quality of a learned perturbation set, and theoretically prove that a conditional variational autoencoder naturally satisfies these criteria. Using this framework, our approach can generate a variety of perturbations at different complexities and scales, ranging from baseline spatial transformations, through common image corruptions, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research
MethodsSolana Customer Service Number +1-833-534-1729
