Explain to Not Forget: Defending Against Catastrophic Forgetting with XAI
Sami Ede, Serop Baghdadlian, Leander Weber, An Nguyen, Dario Zanca,, Wojciech Samek, Sebastian Lapuschkin

TL;DR
This paper introduces a novel training algorithm that uses explainability techniques to prevent catastrophic forgetting in neural networks, enabling continual learning with improved resource efficiency.
Contribution
The paper presents 'training by explaining,' a new method leveraging Layer-wise Relevance Propagation to retain previous knowledge during training on new data.
Findings
Successfully retains old task knowledge in neural networks
More resource-efficient than existing state-of-the-art methods
Effective on various benchmark and complex datasets
Abstract
The ability to continuously process and retain new information like we do naturally as humans is a feat that is highly sought after when training neural networks. Unfortunately, the traditional optimization algorithms often require large amounts of data available during training time and updates wrt. new data are difficult after the training process has been completed. In fact, when new data or tasks arise, previous progress may be lost as neural networks are prone to catastrophic forgetting. Catastrophic forgetting describes the phenomenon when a neural network completely forgets previous knowledge when given new information. We propose a novel training algorithm called training by explaining in which we leverage Layer-wise Relevance Propagation in order to retain the information a neural network has already learned in previous tasks when training on new data. The method is evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
