TL;DR
MePo introduces a meta post-refinement approach leveraging bi-level meta-learning and second-order statistics to enhance rehearsal-free continual learning with pretrained models.
Contribution
It proposes a novel meta post-refinement method that refines pretrained models for better general continual learning without rehearsal.
Findings
Significant performance improvements on multiple GCL benchmarks.
Effective utilization of second-order statistics for robust output alignment.
Plug-in strategy compatible with various pretrained checkpoints.
Abstract
To cope with uncertain changes of the external world, intelligent systems must continually learn from complex, evolving environments and respond in real time. This ability, collectively known as general continual learning (GCL), encapsulates practical challenges such as online datastreams and blurry task boundaries. Although leveraging pretrained models (PTMs) has greatly advanced conventional continual learning (CL), these methods remain limited in reconciling the diverse and temporally mixed information along a single pass, resulting in sub-optimal GCL performance. Inspired by meta-plasticity and reconstructive memory in neuroscience, we introduce here an innovative approach named Meta Post-Refinement (MePo) for PTMs-based GCL. This approach constructs pseudo task sequences from pretraining data and develops a bi-level meta-learning paradigm to refine the pretrained backbone, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
