Simplifying the explanation of deep neural networks with sufficient and necessary feature-sets: case of text classification
Jiechieu Kameni Florentin Flambeau, Tsopze Norbert

TL;DR
This paper introduces a method to simplify explanations of deep neural network decisions by identifying minimal feature-sets that are sufficient and necessary, enhancing interpretability especially in text classification tasks.
Contribution
It proposes a novel approach for extracting sufficient and necessary feature-sets for explaining 1D CNN decisions and adapts Layer-wise Relevance Propagation for this purpose.
Findings
Relevance distribution aligns with state-of-the-art models.
Extracted features are perceptually convincing to humans.
Method improves interpretability of CNN decisions.
Abstract
During the last decade, deep neural networks (DNN) have demonstrated impressive performances solving a wide range of problems in various domains such as medicine, finance, law, etc. Despite their great performances, they have long been considered as black-box systems, providing good results without being able to explain them. However, the inability to explain a system decision presents a serious risk in critical domains such as medicine where people's lives are at stake. Several works have been done to uncover the inner reasoning of deep neural networks. Saliency methods explain model decisions by assigning weights to input features that reflect their contribution to the classifier decision. However, not all features are necessary to explain a model decision. In practice, classifiers might strongly rely on a subset of features that might be sufficient to explain a particular decision.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
