Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization
Jiaxin Deng, Junbiao Pang, Baochang Zhang

TL;DR
This paper introduces AUSAM, a probabilistic sampling method that accelerates Sharpness-Aware Minimization by reducing computational cost while maintaining generalization performance across various tasks and models.
Contribution
AUSAM is a novel, architecture-agnostic sampling technique that approximates gradient norms to speed up SAM without sacrificing accuracy.
Findings
AUSAM achieves over 70% speedup on CIFAR datasets.
AUSAM maintains comparable accuracy to SAM across tasks.
AUSAM outperforms recent data pruning methods in efficiency and performance.
Abstract
Sharpness-Aware Minimization (SAM) has emerged as a promising approach for effectively reducing the generalization error. However, SAM incurs twice the computational cost compared to base optimizer (e.g., SGD). We propose Asymptotic Unbiased Sampling with respect to iterations to accelerate SAM (AUSAM), which maintains the model's generalization capacity while significantly enhancing computational efficiency. Concretely, we probabilistically sample a subset of data points beneficial for SAM optimization based on a theoretically guaranteed criterion, i.e., the Gradient Norm of each Sample (GNS). We further approximate the GNS by the difference in loss values before and after perturbation in SAM. As a plug-and-play, architecture-agnostic method, our approach consistently accelerates SAM across a range of tasks and networks, i.e., classification, human pose estimation and network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Cell Image Analysis Techniques · Industrial Vision Systems and Defect Detection
MethodsBalanced Selection · Graph Network-based Simulators · Segment Anything Model · Pruning
