Dynamic Continual Learning: Harnessing Parameter Uncertainty for Improved Network Adaptation
Christopher Angelini, Nidhal Bouaynaya

TL;DR
This paper introduces a novel continual learning method that uses parameter uncertainty to identify and protect important network parameters during fine-tuning, thereby reducing forgetting and improving performance across tasks.
Contribution
It proposes a Bayesian framework that learns parameter uncertainties and incorporates them into regularization strategies, enhancing network adaptation in continual learning scenarios.
Findings
Improved average test accuracy over existing methods.
Enhanced backward transfer metrics.
Effective parameter importance estimation through uncertainty modeling.
Abstract
When fine-tuning Deep Neural Networks (DNNs) to new data, DNNs are prone to overwriting network parameters required for task-specific functionality on previously learned tasks, resulting in a loss of performance on those tasks. We propose using parameter-based uncertainty to determine which parameters are relevant to a network's learned function and regularize training to prevent change in these important parameters. We approach this regularization in two ways: (1), we constrain critical parameters from significant changes by associating more critical parameters with lower learning rates, thereby limiting alterations in those parameters; (2), important parameters are restricted from change by imposing a higher regularization weighting, causing parameters to revert to their states prior to the learning of subsequent tasks. We leverage a Bayesian Moment Propagation framework which learns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
