Overcoming Catastrophic Forgetting by XAI
Giang Nguyen

TL;DR
This paper introduces CFD, a tool for explaining catastrophic forgetting in neural networks, and proposes Critical Freezing, a new continual learning method that improves performance and enhances explainability.
Contribution
The work presents CFD for interpreting forgetting mechanisms and introduces Critical Freezing, a novel approach that outperforms existing methods in continual learning.
Findings
CFD reveals which network components forget during training.
Critical Freezing significantly reduces catastrophic forgetting.
The proposed method outperforms recent continual learning techniques.
Abstract
Explaining the behaviors of deep neural networks, usually considered as black boxes, is critical especially when they are now being adopted over diverse aspects of human life. Taking the advantages of interpretable machine learning (interpretable ML), this work proposes a novel tool called Catastrophic Forgetting Dissector (or CFD) to explain catastrophic forgetting in continual learning settings. We also introduce a new method called Critical Freezing based on the observations of our tool. Experiments on ResNet articulate how catastrophic forgetting happens, particularly showing which components of this famous network are forgetting. Our new continual learning algorithm defeats various recent techniques by a significant margin, proving the capability of the investigation. Critical freezing not only attacks catastrophic forgetting but also exposes explainability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · Residual Connection · Batch Normalization · Kaiming Initialization · Global Average Pooling · Bottleneck Residual Block · Max Pooling · Residual Block
