Variational Density Propagation Continual Learning
Christopher Angelini, Nidhal Bouaynaya, and Ghulam Rasool

TL;DR
This paper introduces a variational density propagation method for continual learning that reduces catastrophic forgetting by propagating mean and covariance through neural networks, eliminating the need for Monte Carlo sampling.
Contribution
It develops a closed-form ELBO approach that approximates the predictive distribution and applies the MDL principle to minimize model complexity across tasks.
Findings
Effective mitigation of catastrophic forgetting in continual learning.
Elimination of Monte Carlo sampling in Bayesian inference.
Minimal network complexity over multiple tasks.
Abstract
Deep Neural Networks (DNNs) deployed to the real world are regularly subject to out-of-distribution (OoD) data, various types of noise, and shifting conceptual objectives. This paper proposes a framework for adapting to data distribution drift modeled by benchmark Continual Learning datasets. We develop and evaluate a method of Continual Learning that leverages uncertainty quantification from Bayesian Inference to mitigate catastrophic forgetting. We expand on previous approaches by removing the need for Monte Carlo sampling of the model weights to sample the predictive distribution. We optimize a closed-form Evidence Lower Bound (ELBO) objective approximating the predictive distribution by propagating the first two moments of a distribution, i.e. mean and covariance, through all network layers. Catastrophic forgetting is mitigated by using the closed-form ELBO to approximate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
MethodsMinimum Description Length
