Continual Learning with Guarantees via Weight Interval Constraints

Maciej Wo{\l}czyk; Karol J. Piczak; Bartosz W\'ojcik; {\L}ukasz; Pustelnik; Pawe{\l} Morawiecki; Jacek Tabor; Tomasz Trzci\'nski,; Przemys{\l}aw Spurek

arXiv:2206.07996·cs.LG·June 17, 2022·6 cites

Continual Learning with Guarantees via Weight Interval Constraints

Maciej Wo{\l}czyk, Karol J. Piczak, Bartosz W\'ojcik, {\L}ukasz, Pustelnik, Pawe{\l} Morawiecki, Jacek Tabor, Tomasz Trzci\'nski,, Przemys{\l}aw Spurek

PDF

Open Access 3 Repos

TL;DR

This paper proposes Hyperrectangle Training, a novel continual learning method that uses interval constraints on neural network parameters to guarantee bounded forgetting and improve resilience without storing past data.

Contribution

It introduces a new training paradigm that enforces hyperrectangle constraints on parameters, transforming the continual learning problem into a polynomial-time, forgetting-resilient process.

Findings

01

The InterContiNet algorithm effectively models parameter regions with interval arithmetic.

02

The approach provides full resilience against forgetting in continual learning tasks.

03

Experimental results show competitive performance without storing previous data.

Abstract

We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate uncontrollably over time. In this work, we show how to put bounds on forgetting by reformulating continual learning of a model as a continual contraction of its parameter space. To that end, we propose Hyperrectangle Training, a new training methodology where each task is represented by a hyperrectangle in the parameter space, fully contained in the hyperrectangles of the previous tasks. This formulation reduces the NP-hard CL problem back to polynomial time while providing full resilience against…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications