Gaussian MRF Covariance Modeling for Efficient Black-Box Adversarial Attacks
Anit Kumar Sahu, Satya Narayan Shukla, J. Zico Kolter

TL;DR
This paper introduces a Gaussian Markov random field model to capture gradient correlations in black-box adversarial attacks, enabling faster, more efficient attacks with fewer queries and higher success rates.
Contribution
It proposes a novel GMRF-based gradient modeling approach for black-box adversarial attacks, improving efficiency and success compared to prior zeroth order methods.
Findings
Fewer queries needed for successful attacks.
Higher attack success rates than existing methods.
Efficient covariance representation using FFT and low-rank updates.
Abstract
We study the problem of generating adversarial examples in a black-box setting, where we only have access to a zeroth order oracle, providing us with loss function evaluations. Although this setting has been investigated in previous work, most past approaches using zeroth order optimization implicitly assume that the gradients of the loss function with respect to the input images are \emph{unstructured}. In this work, we show that in fact substantial correlations exist within these gradients, and we propose to capture these correlations via a Gaussian Markov random field (GMRF). Given the intractability of the explicit covariance structure of the MRF, we show that the covariance structure can be efficiently represented using the Fast Fourier Transform (FFT), along with low-rank updates to perform exact posterior estimation under this model. We use this modeling technique to find fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
