Exemplar-Free Continual Transformer with Convolutions
Anurag Roy, Vinay Kumar Verma, Sravan Voonna, Kripabandhu Ghosh,, Saptarshi Ghosh, Abir Das

TL;DR
This paper introduces ConTraCon, an exemplar-free continual learning method for vision transformers that re-weights attention weights with convolutions and predicts tasks without task IDs, outperforming existing methods.
Contribution
The paper presents a novel exemplar-free continual learning approach for vision transformers that re-weights attention layers with convolutions and predicts tasks without task IDs.
Findings
Outperforms several competitive approaches on benchmark datasets.
Requires fewer parameters than existing methods.
Effectively predicts tasks without explicit task IDs.
Abstract
Continual Learning (CL) involves training a machine learning model in a sequential manner to learn new information while retaining previously learned tasks without the presence of previous training data. Although there has been significant interest in CL, most recent CL approaches in computer vision have focused on convolutional architectures only. However, with the recent success of vision transformers, there is a need to explore their potential for CL. Although there have been some recent CL approaches for vision transformers, they either store training instances of previous tasks or require a task identifier during test time, which can be limiting. This paper proposes a new exemplar-free approach for class/task incremental learning called ConTraCon, which does not require task-id to be explicitly present during inference and avoids the need for storing previous training instances.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Exemplar-Free Continual Transformer with Convolutions· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
