EXACFS -- A CIL Method to mitigate Catastrophic Forgetting
S Balasubramanian, M Sai Subramaniam, Sai Sriram Talasu, Yedu Krishna, P, Manepalli Pranav Phanindra Sai, Ravi Mukkamala, Darshan Gera

TL;DR
This paper proposes EXACFS, a novel class incremental learning method that uses feature significance estimation and distillation to reduce catastrophic forgetting in neural networks, showing improved stability and plasticity.
Contribution
Introduction of EXACFS, a new CIL approach that estimates feature significance via loss gradients and employs distillation to mitigate forgetting.
Findings
Outperforms existing methods on CIFAR-100 and ImageNet-100
Effectively balances stability and plasticity in continual learning
Reduces catastrophic forgetting significantly
Abstract
Deep neural networks (DNNS) excel at learning from static datasets but struggle with continual learning, where data arrives sequentially. Catastrophic forgetting, the phenomenon of forgetting previously learned knowledge, is a primary challenge. This paper introduces EXponentially Averaged Class-wise Feature Significance (EXACFS) to mitigate this issue in the class incremental learning (CIL) setting. By estimating the significance of model features for each learned class using loss gradients, gradually aging the significance through the incremental tasks and preserving the significant features through a distillation loss, EXACFS effectively balances remembering old knowledge (stability) and learning new knowledge (plasticity). Extensive experiments on CIFAR-100 and ImageNet-100 demonstrate EXACFS's superior performance in preserving stability while acquiring plasticity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Semantic Web and Ontologies · Domain Adaptation and Few-Shot Learning
