Lookbehind-SAM: k steps back, 1 step forward
Gon\c{c}alo Mordido, Pranshu Malviya, Aristide Baratin, Sarath Chandar

TL;DR
Lookbehind-SAM introduces multiple ascent steps behind the current point to better identify worst-case perturbations, improving generalization, robustness, and lifelong learning performance.
Contribution
It proposes a novel Lookbehind method that enhances SAM by performing multiple ascent steps behind, inspired by Lookahead, to improve loss-sharpness trade-offs.
Findings
Enhanced generalization performance across tasks
Increased robustness against noisy weights
Improved learning and reduced catastrophic forgetting
Abstract
Sharpness-aware minimization (SAM) methods have gained increasing popularity by formulating the problem of minimizing both loss value and loss sharpness as a minimax objective. In this work, we increase the efficiency of the maximization and minimization parts of SAM's objective to achieve a better loss-sharpness trade-off. By taking inspiration from the Lookahead optimizer, which uses multiple descent steps ahead, we propose Lookbehind, which performs multiple ascent steps behind to enhance the maximization step of SAM and find a worst-case perturbation with higher loss. Then, to mitigate the variance in the descent step arising from the gathered gradients across the multiple ascent steps, we employ linear interpolation to refine the minimization step. Lookbehind leads to a myriad of benefits across a variety of tasks. Particularly, we show increased generalization performance, greater…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Image Enhancement Techniques
MethodsSegment Anything Model · Sharpness-Aware Minimization · Lookahead
