Pushing the Limits of Distillation-Based Continual Learning via Classifier-Proximal Lightweight Plugins

Zhiming Xu; Baile Xu; Jian Zhao; Furao Shen; Suorong Yang

arXiv:2512.03537·cs.LG·April 6, 2026

Pushing the Limits of Distillation-Based Continual Learning via Classifier-Proximal Lightweight Plugins

Zhiming Xu, Baile Xu, Jian Zhao, Furao Shen, Suorong Yang

PDF

TL;DR

This paper introduces DLC, a plugin-based method for distillation-based continual learning that improves accuracy with minimal additional parameters by using lightweight residual plugins and importance weighting.

Contribution

The paper proposes a novel plugin extension paradigm called DLC that enhances distillation-based continual learning through classifier-proximal lightweight residual plugins.

Findings

01

DLC achieves an 8% accuracy improvement on large-scale benchmarks.

02

DLC adds only 4% more parameters to the backbone.

03

DLC is compatible with other continual learning enhancements.

Abstract

Continual learning requires models to learn continuously while preserving prior knowledge under evolving data streams. Distillation-based methods are appealing for retaining past knowledge in a shared single-model framework with low storage overhead. However, they remain constrained by the stability-plasticity dilemma: knowledge acquisition and preservation are still optimized through coupled objectives, and existing enhancement methods do not alter this underlying bottleneck. To address this issue, we propose a plugin extension paradigm termed Distillation-aware Lightweight Components (DLC) for distillation-based CL. DLC deploys lightweight residual plugins into the base feature extractor's classifier-proximal layer, enabling semantic-level residual correction for better classification accuracy while minimizing disruption to the overall feature extraction process. During inference,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.