Phase-shifted Adversarial Training
Yeachan Kim, Seongyeon Kim, Ihyeok Seo, Bonggun Shin

TL;DR
This paper introduces Phase-shifted Adversarial Training (PhaseAT), a novel method that shifts frequency learning to improve neural network robustness by better capturing high-frequency information.
Contribution
The paper proposes PhaseAT, which shifts the frequency learning range to enhance high-frequency content learning in adversarial training, improving robustness.
Findings
PhaseAT significantly improves convergence for high-frequency information.
PhaseAT enhances adversarial robustness with smoothed predictions.
Experiments on CIFAR-10 and ImageNet validate effectiveness.
Abstract
Adversarial training has been considered an imperative component for safely deploying neural network-based applications to the real world. To achieve stronger robustness, existing methods primarily focus on how to generate strong attacks by increasing the number of update steps, regularizing the models with the smoothed loss function, and injecting the randomness into the attack. Instead, we analyze the behavior of adversarial training through the lens of response frequency. We empirically discover that adversarial training causes neural networks to have low convergence to high-frequency information, resulting in highly oscillated predictions near each data. To learn high-frequency contents efficiently and effectively, we first prove that a universal phenomenon of frequency principle, i.e., \textit{lower frequencies are learned first}, still holds in adversarial training. Based on that,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Nuclear Physics and Applications · Anomaly Detection Techniques and Applications
