Temporal-Difference Variational Continual Learning
Luckeciano C. Melo, Alessandro Abate, Yarin Gal

TL;DR
This paper introduces a novel variational continual learning method that integrates multiple past posterior estimates to reduce approximation errors and employs Temporal-Difference concepts, significantly improving knowledge retention in sequential learning tasks.
Contribution
It proposes new learning objectives combining past posterior regularization with TD-inspired techniques, addressing catastrophic forgetting more effectively than existing variational methods.
Findings
Outperforms existing Variational CL methods on benchmarks
Effectively mitigates catastrophic forgetting
Connects variational updates with Temporal-Difference learning
Abstract
Machine Learning models in real-world applications must continuously learn new tasks to adapt to shifts in the data-generating distribution. Yet, for Continual Learning (CL), models often struggle to balance learning new tasks (plasticity) with retaining previous knowledge (memory stability). Consequently, they are susceptible to Catastrophic Forgetting, which degrades performance and undermines the reliability of deployed systems. In the Bayesian CL literature, variational methods tackle this challenge by employing a learning objective that recursively updates the posterior distribution while constraining it to stay close to its previous estimate. Nonetheless, we argue that these methods may be ineffective due to compounding approximation errors over successive recursions. To mitigate this, we propose new learning objectives that integrate the regularization effects of multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing
