MGSER-SAM: Memory-Guided Soft Experience Replay with Sharpness-Aware Optimization for Enhanced Continual Learning
Xingyu Li, Bo Tang

TL;DR
MGSER-SAM introduces a memory replay-based continual learning algorithm that combines sharpness-aware optimization with strategies to mitigate task interference, significantly improving accuracy and reducing forgetting across benchmarks.
Contribution
It integrates the SAM optimizer into experience replay frameworks and addresses task conflict issues with soft logits and gradient alignment, advancing continual learning methods.
Findings
Outperforms baselines ER and DER++ in accuracy by 24.4% and 17.6%.
Achieves the lowest forgetting across multiple benchmarks.
Demonstrates consistent improvements in three continual learning scenarios.
Abstract
Deep neural networks suffer from the catastrophic forgetting problem in the field of continual learning (CL). To address this challenge, we propose MGSER-SAM, a novel memory replay-based algorithm specifically engineered to enhance the generalization capabilities of CL models. We first intergrate the SAM optimizer, a component designed for optimizing flatness, which seamlessly fits into well-known Experience Replay frameworks such as ER and DER++. Then, MGSER-SAM distinctively addresses the complex challenge of reconciling conflicts in weight perturbation directions between ongoing tasks and previously stored memories, which is underexplored in the SAM optimizer. This is effectively accomplished by the strategic integration of soft logits and the alignment of memory gradient directions, where the regularization terms facilitate the concurrent minimization of various training loss terms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Seismology and Earthquake Studies
MethodsExperience Replay · Segment Anything Model
