# On the Effectiveness of Low Frequency Perturbations

**Authors:** Yash Sharma, Gavin Weiguang Ding, Marcus Brubaker

arXiv: 1903.00073 · 2019-06-04

## TL;DR

This paper investigates the role of frequency components in adversarial perturbations, showing that low frequency perturbations are more effective and challenging the reliance on high frequency assumptions in current defenses.

## Contribution

It systematically analyzes the impact of frequency components on adversarial attacks, revealing that low frequency perturbations significantly compromise model robustness and questioning current defense strategies.

## Key findings

- Low frequency perturbations improve attack success in white-box and black-box settings.
- State-of-the-art defenses are as vulnerable to low frequency attacks as to undefended models.
- Low frequency perturbations are perceptible under typical distortion bounds.

## Abstract

Carefully crafted, often imperceptible, adversarial perturbations have been shown to cause state-of-the-art models to yield extremely inaccurate outputs, rendering them unsuitable for safety-critical application domains. In addition, recent work has shown that constraining the attack space to a low frequency regime is particularly effective. Yet, it remains unclear whether this is due to generally constraining the attack search space or specifically removing high frequency components from consideration. By systematically controlling the frequency components of the perturbation, evaluating against the top-placing defense submissions in the NeurIPS 2017 competition, we empirically show that performance improvements in both the white-box and black-box transfer settings are yielded only when low frequency components are preserved. In fact, the defended models based on adversarial training are roughly as vulnerable to low frequency perturbations as undefended models, suggesting that the purported robustness of state-of-the-art ImageNet defenses is reliant upon adversarial perturbations being high frequency in nature. We do find that under $\ell_\infty$ $\epsilon=16/255$, the competition distortion bound, low frequency perturbations are indeed perceptible. This questions the use of the $\ell_\infty$-norm, in particular, as a distortion metric, and, in turn, suggests that explicitly considering the frequency space is promising for learning robust models which better align with human perception.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.00073/full.md

## Figures

156 figures with captions in the complete paper: https://tomesphere.com/paper/1903.00073/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/1903.00073/full.md

---
Source: https://tomesphere.com/paper/1903.00073