Understanding and Combating Robust Overfitting via Input Loss Landscape   Analysis and Regularization

Lin Li; Michael Spratling

arXiv:2212.04985·cs.LG·December 12, 2022

Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

Lin Li, Michael Spratling

PDF

1 Repo

TL;DR

This paper analyzes the causes of robust overfitting in adversarial training by examining the input loss landscape and proposes a new regularizer to improve robustness and generalization.

Contribution

It introduces a novel loss landscape regularizer that mitigates robust overfitting and enhances adversarial robustness during training.

Findings

01

Regularization of loss gradients reduces overfitting.

02

Adversarial training's gradient regularization weakens with increased landscape curvature.

03

Proposed method outperforms previous approaches in robustness and efficiency.

Abstract

Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms underlying overfitting through analyzing the loss landscape w.r.t. the input. We find that robust overfitting results from standard training, specifically the minimization of the clean loss, and can be mitigated by regularization of the loss gradients. Moreover, we find that robust overfitting turns severer during adversarial training partially because the gradient regularization effect of adversarial training becomes weaker due to the increase in the loss landscapes curvature. To improve robust generalization, we propose a new regularizer to smooth the loss landscape by penalizing the weighted logits variation along the adversarial direction. Our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

treelli/combating-ro-advlc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.