Class incremental learning with probability dampening and cascaded gated classifier
Jary Pomponi, Alessio Devoto, Simone Scardapane

TL;DR
This paper introduces a novel regularization method and a cascaded gated classifier to improve continual learning in neural networks, reducing forgetting without extensive replay buffers.
Contribution
It proposes Margin Dampening and Cascaded Scaling Classifier, novel techniques that enhance knowledge retention and task adaptation in class incremental learning.
Findings
Outperforms baseline methods on multiple benchmarks.
Effectively mitigates forgetting without large memory buffers.
Analyzes component contributions to overall performance.
Abstract
Humans are capable of acquiring new knowledge and transferring learned knowledge into different domains, incurring a small forgetting. The same ability, called Continual Learning, is challenging to achieve when operating with neural networks due to the forgetting affecting past learned tasks when learning new ones. This forgetting can be mitigated by replaying stored samples from past tasks, but a large memory size may be needed for long sequences of tasks; moreover, this could lead to overfitting on saved samples. In this paper, we propose a novel regularisation approach and a novel incremental classifier called, respectively, Margin Dampening and Cascaded Scaling Classifier. The first combines a soft constraint and a knowledge distillation approach to preserve past learned knowledge while allowing the model to learn new patterns effectively. The latter is a gated incremental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Anomaly Detection Techniques and Applications · Text and Document Classification Technologies
MethodsKnowledge Distillation
