Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer
Zhenrong Liu, Janne M. J. Huttunen, Mikko Honkala

TL;DR
This paper introduces two model compression frameworks using pruning and knowledge distillation to enable low-complexity inference in continual learning, especially for class-incremental tasks without task labels.
Contribution
It proposes novel, efficient frameworks for compressing large pre-trained models in continual learning, improving inference speed while maintaining accuracy.
Findings
Both frameworks outperform strong baselines in accuracy-efficiency trade-offs.
Pruning strategies effectively reduce model size with minimal accuracy loss.
Knowledge distillation transfers relevant knowledge to smaller models, enhancing efficiency.
Abstract
Continual learning (CL) aims to train models that can learn a sequence of tasks without forgetting previously acquired knowledge. A core challenge in CL is balancing stability -- preserving performance on old tasks -- and plasticity -- adapting to new ones. Recently, large pre-trained models have been widely adopted in CL for their ability to support both, offering strong generalization for new tasks and resilience against forgetting. However, their high computational cost at inference time limits their practicality in real-world applications, especially those requiring low latency or energy efficiency. To address this issue, we explore model compression techniques, including pruning and knowledge distillation (KD), and propose two efficient frameworks tailored for class-incremental learning (CIL), a challenging CL setting where task identities are unavailable during inference. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis
MethodsKnowledge Distillation · Pruning
