Dynamic Continual Learning: Harnessing Parameter Uncertainty for   Improved Network Adaptation

Christopher Angelini; Nidhal Bouaynaya

arXiv:2501.10861·cs.LG·January 22, 2025

Dynamic Continual Learning: Harnessing Parameter Uncertainty for Improved Network Adaptation

Christopher Angelini, Nidhal Bouaynaya

PDF

TL;DR

This paper introduces a novel continual learning method that uses parameter uncertainty to identify and protect important network parameters during fine-tuning, thereby reducing forgetting and improving performance across tasks.

Contribution

It proposes a Bayesian framework that learns parameter uncertainties and incorporates them into regularization strategies, enhancing network adaptation in continual learning scenarios.

Findings

01

Improved average test accuracy over existing methods.

02

Enhanced backward transfer metrics.

03

Effective parameter importance estimation through uncertainty modeling.

Abstract

When fine-tuning Deep Neural Networks (DNNs) to new data, DNNs are prone to overwriting network parameters required for task-specific functionality on previously learned tasks, resulting in a loss of performance on those tasks. We propose using parameter-based uncertainty to determine which parameters are relevant to a network's learned function and regularize training to prevent change in these important parameters. We approach this regularization in two ways: (1), we constrain critical parameters from significant changes by associating more critical parameters with lower learning rates, thereby limiting alterations in those parameters; (2), important parameters are restricted from change by imposing a higher regularization weighting, causing parameters to revert to their states prior to the learning of subsequent tasks. We leverage a Bayesian Moment Propagation framework which learns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.