Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection
Ruilin Li, Heming Zou, Xiufeng Yan, Zheming Liang, Jie Yang, Chenliang Li, Xue Yang

TL;DR
This paper introduces SCL-MGSM, a data-guided method for constructing a more expressive and stable random projection layer in continual learning, significantly improving performance over existing random projection approaches.
Contribution
It proposes a novel data-guided projection layer construction mechanism, enhancing expressivity and stability in continual learning with pre-trained models.
Findings
Outperforms state-of-the-art methods on CIL benchmarks
Achieves more stable and expressive feature representations
Reduces negative effects of domain gaps in continual learning
Abstract
Recent paradigms in Random Projection Layer (RPL)-based continual representation learning have demonstrated superior performance when building upon a pre-trained model (PTM). These methods insert a randomly initialized RPL after a PTM to enhance feature representation in the initial stage. Subsequently, a linear classification head is used for analytic updates in the continual learning stage. However, under severe domain gaps between pre-trained representations and target domains, a randomly initialized RPL exhibits limited expressivity under large domain shifts. While largely scaling up the RPL dimension can improve expressivity, it also induces an ill-conditioned feature matrix, thereby destabilizing the recursive analytic updates of the linear head. To this end, we propose the Stochastic Continual Learner with MemoryGuard Supervisory Mechanism (SCL-MGSM). Unlike random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis
