Exploiting Task Relationships in Continual Learning via Transferability-Aware Task Embeddings
Yanru Wu, Jianning Wang, Xiangyu Chen, Enming Zhang, Yang Tan, Hanbing Liu, Yang Li

TL;DR
This paper introduces a transferability-aware task embedding called H-embedding, integrated into a hypernet framework, to improve continual learning by effectively leveraging inter-task relationships and enhancing transfer.
Contribution
It proposes a novel H-embedding derived from information theory, enabling online, low-dimensional, and efficient task-conditioned model learning for continual learning.
Findings
Outperforms baseline and SOTA methods on CIFAR-100, ImageNet-R, and DomainNet.
Efficiently captures intrinsic task relationships with low storage overhead.
Supports end-to-end training with practical computational requirements.
Abstract
Continual learning (CL) has been a critical topic in contemporary deep neural network applications, where higher levels of both forward and backward transfer are desirable for an effective CL performance. Existing CL strategies primarily focus on task models, either by regularizing model updates or by separating task-specific and shared components, while often overlooking the potential of leveraging inter-task relationships to enhance transfer. To address this gap, we propose a transferability-aware task embedding, termed H-embedding, and construct a hypernet framework under its guidance to learn task-conditioned model weights for CL tasks. Specifically, H-embedding is derived from an information theoretic measure of transferability and is designed to be online and easy to compute. Our method is also characterized by notable practicality, requiring only the storage of a low-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsFocus
