LSEBMCL: A Latent Space Energy-Based Model for Continual Learning
Xiaodi Li, Dingcheng Li, Rujun Gao, Mahmoud Zamani, and Latifur Khan

TL;DR
This paper introduces LSEBMCL, a novel energy-based model that mitigates catastrophic forgetting in continual learning by sampling from previous tasks, demonstrating state-of-the-art results in NLP applications.
Contribution
It proposes using an energy-based model as an outer-generator to prevent forgetting, a new approach in continual learning for NLP tasks.
Findings
Achieves state-of-the-art results in NLP continual learning tasks.
Effectively prevents catastrophic forgetting using EBM sampling.
Demonstrates the efficacy of EBMs in practical NLP applications.
Abstract
Continual learning has become essential in many practical applications such as online news summaries and product classification. The primary challenge is known as catastrophic forgetting, a phenomenon where a model inadvertently discards previously learned knowledge when it is trained on new tasks. Existing solutions involve storing exemplars from previous classes, regularizing parameters during the fine-tuning process, or assigning different model parameters to each task. The proposed solution LSEBMCL (Latent Space Energy-Based Model for Continual Learning) in this work is to use energy-based models (EBMs) to prevent catastrophic forgetting by sampling data points from previous tasks when training on new ones. The EBM is a machine learning model that associates an energy value with each input data point. The proposed method uses an EBM layer as an outer-generator in the continual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning
Methodsenergy-based model
