Continual Learning with Guarantees via Weight Interval Constraints
Maciej Wo{\l}czyk, Karol J. Piczak, Bartosz W\'ojcik, {\L}ukasz, Pustelnik, Pawe{\l} Morawiecki, Jacek Tabor, Tomasz Trzci\'nski,, Przemys{\l}aw Spurek

TL;DR
This paper proposes Hyperrectangle Training, a novel continual learning method that uses interval constraints on neural network parameters to guarantee bounded forgetting and improve resilience without storing past data.
Contribution
It introduces a new training paradigm that enforces hyperrectangle constraints on parameters, transforming the continual learning problem into a polynomial-time, forgetting-resilient process.
Findings
The InterContiNet algorithm effectively models parameter regions with interval arithmetic.
The approach provides full resilience against forgetting in continual learning tasks.
Experimental results show competitive performance without storing previous data.
Abstract
We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate uncontrollably over time. In this work, we show how to put bounds on forgetting by reformulating continual learning of a model as a continual contraction of its parameter space. To that end, we propose Hyperrectangle Training, a new training methodology where each task is represented by a hyperrectangle in the parameter space, fully contained in the hyperrectangles of the previous tasks. This formulation reduces the NP-hard CL problem back to polynomial time while providing full resilience against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
