Enhancing Visual Continual Learning with Language-Guided Supervision
Bolin Ni, Hongbo Zhao, Chenghao Zhang, Ke Hu, Gaofeng Meng, Zhaoxiang, Zhang, Shiming Xiang

TL;DR
This paper introduces a novel approach to continual learning by replacing traditional classifier heads with semantic supervision from pretrained language models, improving knowledge transfer and reducing forgetting.
Contribution
It proposes using PLMs to generate semantic class targets, enhancing semantic understanding and transfer in continual visual learning tasks.
Findings
Improves Top-1 accuracy on ImageNet-100 by up to 6.1%.
Reduces forgetting rate significantly across multiple benchmarks.
Seamlessly integrates with existing continual learning methods.
Abstract
Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category name of each class is largely neglected. Existing methods commonly utilize the one-hot labels and randomly initialize the classifier head. We argue that the scarce semantic information conveyed by the one-hot labels hampers the effective knowledge transfer across tasks. In this paper, we revisit the role of the classifier head within the CL paradigm and replace the classifier with semantic knowledge from pretrained language models (PLMs). Specifically, we use PLMs to generate semantic targets for each class, which are frozen and serve as supervision signals during training. Such targets fully consider the semantic correlation between all classes across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReflective Practices in Education
