Anytime Continual Learning for Open Vocabulary Classification
Zhen Zhu, Yiming Gong, Derek Hoiem

TL;DR
This paper introduces AnytimeCL, a continual learning approach for open vocabulary image classification that allows models to predict any labels at any time and efficiently update with new data, outperforming recent methods.
Contribution
The paper presents a dynamic weighting scheme and an attention-weighted PCA compression to enable flexible, efficient continual learning for open vocabulary classification.
Findings
Significant improvements over recent methods in continual learning performance.
Effective feature compression with minimal accuracy loss.
Enhanced flexibility in learning and inference for open vocabulary tasks.
Abstract
We propose an approach for anytime continual learning (AnytimeCL) for open vocabulary image classification. The AnytimeCL problem aims to break away from batch training and rigid models by requiring that a system can predict any set of labels at any time and efficiently update and improve when receiving one or more training samples at any time. Despite the challenging goal, we achieve substantial improvements over recent methods. We propose a dynamic weighting between predictions of a partially fine-tuned model and a fixed open vocabulary model that enables continual improvement when training samples are available for a subset of a task's labels. We also propose an attention-weighted PCA compression of training features that reduces storage and computation with little impact to model accuracy. Our methods are validated with experiments that test flexibility of learning and inference.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques
MethodsSparse Evolutionary Training · Principal Components Analysis
