Exploiting Task Relationships in Continual Learning via Transferability-Aware Task Embeddings

Yanru Wu; Jianning Wang; Xiangyu Chen; Enming Zhang; Yang Tan; Hanbing Liu; Yang Li

arXiv:2502.11609·cs.LG·January 15, 2026

Exploiting Task Relationships in Continual Learning via Transferability-Aware Task Embeddings

Yanru Wu, Jianning Wang, Xiangyu Chen, Enming Zhang, Yang Tan, Hanbing Liu, Yang Li

PDF

Open Access

TL;DR

This paper introduces a transferability-aware task embedding called H-embedding, integrated into a hypernet framework, to improve continual learning by effectively leveraging inter-task relationships and enhancing transfer.

Contribution

It proposes a novel H-embedding derived from information theory, enabling online, low-dimensional, and efficient task-conditioned model learning for continual learning.

Findings

01

Outperforms baseline and SOTA methods on CIFAR-100, ImageNet-R, and DomainNet.

02

Efficiently captures intrinsic task relationships with low storage overhead.

03

Supports end-to-end training with practical computational requirements.

Abstract

Continual learning (CL) has been a critical topic in contemporary deep neural network applications, where higher levels of both forward and backward transfer are desirable for an effective CL performance. Existing CL strategies primarily focus on task models, either by regularizing model updates or by separating task-specific and shared components, while often overlooking the potential of leveraging inter-task relationships to enhance transfer. To address this gap, we propose a transferability-aware task embedding, termed H-embedding, and construct a hypernet framework under its guidance to learn task-conditioned model weights for CL tasks. Specifically, H-embedding is derived from an information theoretic measure of transferability and is designed to be online and easy to compute. Our method is also characterized by notable practicality, requiring only the storage of a low-dimensional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsFocus