Temporal-Difference Variational Continual Learning

Luckeciano C. Melo; Alessandro Abate; Yarin Gal

arXiv:2410.07812·cs.LG·October 24, 2025

Temporal-Difference Variational Continual Learning

Luckeciano C. Melo, Alessandro Abate, Yarin Gal

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel variational continual learning method that integrates multiple past posterior estimates to reduce approximation errors and employs Temporal-Difference concepts, significantly improving knowledge retention in sequential learning tasks.

Contribution

It proposes new learning objectives combining past posterior regularization with TD-inspired techniques, addressing catastrophic forgetting more effectively than existing variational methods.

Findings

01

Outperforms existing Variational CL methods on benchmarks

02

Effectively mitigates catastrophic forgetting

03

Connects variational updates with Temporal-Difference learning

Abstract

Machine Learning models in real-world applications must continuously learn new tasks to adapt to shifts in the data-generating distribution. Yet, for Continual Learning (CL), models often struggle to balance learning new tasks (plasticity) with retaining previous knowledge (memory stability). Consequently, they are susceptible to Catastrophic Forgetting, which degrades performance and undermines the reliability of deployed systems. In the Bayesian CL literature, variational methods tackle this challenge by employing a learning objective that recursively updates the posterior distribution while constraining it to stay close to its previous estimate. Nonetheless, we argue that these methods may be ineffective due to compounding approximation errors over successive recursions. To mitigate this, we propose new learning objectives that integrate the regularization effects of multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luckeciano/TD-VCL
pytorchOfficial

Videos

Temporal-Difference Variational Continual Learning· slideslive

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing