HINT: Hypernetwork Approach to Training Weight Interval Regions in Continual Learning
Patryk Krukowski, Anna Bielawska, Kamil Ksi\k{a}\.zek, Pawe{\l}, Wawrzy\'nski, Pawe{\l} Batorski, Przemys{\l}aw Spurek

TL;DR
HINT introduces a hypernetwork-based method that uses interval arithmetic in a lower-dimensional embedding space to improve training efficiency and prevent forgetting in continual learning, achieving state-of-the-art results.
Contribution
The paper proposes HINT, a novel approach combining interval arithmetic and hypernetworks to manage weight intervals in a lower-dimensional space for continual learning.
Findings
HINT outperforms InterContiNet on multiple benchmarks.
HINT achieves faster and more efficient training.
HINT maintains task performance without forgetting.
Abstract
Recently, a new Continual Learning (CL) paradigm was presented to control catastrophic forgetting, called Interval Continual Learning (InterContiNet), which relies on enforcing interval constraints on the neural network parameter space. Unfortunately, InterContiNet training is challenging due to the high dimensionality of the weight space, making intervals difficult to manage. To address this issue, we introduce HINT, a technique that employs interval arithmetic within the embedding space and utilizes a hypernetwork to map these intervals to the target network parameter space. We train interval embeddings for consecutive tasks and train a hypernetwork to transform these embeddings into weights of the target network. An embedding for a given task is trained along with the hypernetwork, preserving the response of the target network for the previous task embeddings. Interval arithmetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning
MethodsHierarchical Information Threading · Sparse Evolutionary Training · HyperNetwork
