Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer

Zhenrong Liu; Janne M. J. Huttunen; Mikko Honkala

arXiv:2505.08327·cs.LG·May 14, 2025

Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer

Zhenrong Liu, Janne M. J. Huttunen, Mikko Honkala

PDF

Open Access

TL;DR

This paper introduces two model compression frameworks using pruning and knowledge distillation to enable low-complexity inference in continual learning, especially for class-incremental tasks without task labels.

Contribution

It proposes novel, efficient frameworks for compressing large pre-trained models in continual learning, improving inference speed while maintaining accuracy.

Findings

01

Both frameworks outperform strong baselines in accuracy-efficiency trade-offs.

02

Pruning strategies effectively reduce model size with minimal accuracy loss.

03

Knowledge distillation transfers relevant knowledge to smaller models, enhancing efficiency.

Abstract

Continual learning (CL) aims to train models that can learn a sequence of tasks without forgetting previously acquired knowledge. A core challenge in CL is balancing stability -- preserving performance on old tasks -- and plasticity -- adapting to new ones. Recently, large pre-trained models have been widely adopted in CL for their ability to support both, offering strong generalization for new tasks and resilience against forgetting. However, their high computational cost at inference time limits their practicality in real-world applications, especially those requiring low latency or energy efficiency. To address this issue, we explore model compression techniques, including pruning and knowledge distillation (KD), and propose two efficient frameworks tailored for class-incremental learning (CIL), a challenging CL setting where task identities are unavailable during inference. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis

MethodsKnowledge Distillation · Pruning