HINT: Hypernetwork Approach to Training Weight Interval Regions in   Continual Learning

Patryk Krukowski; Anna Bielawska; Kamil Ksi\k{a}\.zek; Pawe{\l}; Wawrzy\'nski; Pawe{\l} Batorski; Przemys{\l}aw Spurek

arXiv:2405.15444·cs.LG·May 7, 2025

HINT: Hypernetwork Approach to Training Weight Interval Regions in Continual Learning

Patryk Krukowski, Anna Bielawska, Kamil Ksi\k{a}\.zek, Pawe{\l}, Wawrzy\'nski, Pawe{\l} Batorski, Przemys{\l}aw Spurek

PDF

Open Access 2 Repos

TL;DR

HINT introduces a hypernetwork-based method that uses interval arithmetic in a lower-dimensional embedding space to improve training efficiency and prevent forgetting in continual learning, achieving state-of-the-art results.

Contribution

The paper proposes HINT, a novel approach combining interval arithmetic and hypernetworks to manage weight intervals in a lower-dimensional space for continual learning.

Findings

01

HINT outperforms InterContiNet on multiple benchmarks.

02

HINT achieves faster and more efficient training.

03

HINT maintains task performance without forgetting.

Abstract

Recently, a new Continual Learning (CL) paradigm was presented to control catastrophic forgetting, called Interval Continual Learning (InterContiNet), which relies on enforcing interval constraints on the neural network parameter space. Unfortunately, InterContiNet training is challenging due to the high dimensionality of the weight space, making intervals difficult to manage. To address this issue, we introduce HINT, a technique that employs interval arithmetic within the embedding space and utilizes a hypernetwork to map these intervals to the target network parameter space. We train interval embeddings for consecutive tasks and train a hypernetwork to transform these embeddings into weights of the target network. An embedding for a given task is trained along with the hypernetwork, preserving the response of the target network for the previous task embeddings. Interval arithmetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning

MethodsHierarchical Information Threading · Sparse Evolutionary Training · HyperNetwork