Local Reweighting for Adversarial Training
Ruize Gao, Feng Liu, Kaiwen Zhou, Gang Niu, Bo Han, James Cheng

TL;DR
This paper introduces locally reweighted adversarial training (LRAT), which adaptively adjusts instance weights based on attack-specific safeness, improving robustness against unseen attacks compared to previous reweighting methods.
Contribution
The paper proposes LRAT, a novel attack-dependent reweighting method that pairs instances with adversarial variants for local reweighting, enhancing robustness across different attack types.
Findings
LRAT outperforms IRAT and standard adversarial training on unseen attacks.
Local reweighting improves robustness without global reweighting.
LRAT effectively defends against diverse adversarial attacks.
Abstract
Instances-reweighted adversarial training (IRAT) can significantly boost the robustness of trained models, where data being less/more vulnerable to the given attack are assigned smaller/larger weights during training. However, when tested on attacks different from the given attack simulated in training, the robustness may drop significantly (e.g., even worse than no reweighting). In this paper, we study this problem and propose our solution--locally reweighted adversarial training (LRAT). The rationale behind IRAT is that we do not need to pay much attention to an instance that is already safe under the attack. We argue that the safeness should be attack-dependent, so that for the same instance, its weight can change given different attacks based on the same model. Thus, if the attack simulated in training is mis-specified, the weights of IRAT are misleading. To this end, LRAT pairs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
