Providing Error Detection for Deep Learning Image Classifiers Using Self-Explainability
Mohammad Mahdi Karimi, Azin Heidarshenas, William W. Edmonson

TL;DR
This paper introduces a self-explainable deep learning system for image classification that not only predicts classes but also explains its decisions and detects potential errors, enhancing safety in critical applications.
Contribution
The paper presents a novel SE-DL system that generates human-understandable explanations and leverages them for effective error detection in image classifiers.
Findings
The SE-DL system improves error detection accuracy.
Concept selection enhances error detection performance.
Compared schemes outperform non-explainable error detection methods.
Abstract
This paper proposes a self-explainable Deep Learning (SE-DL) system for an image classification problem that performs self-error detection. The self-error detection is key to improving the DL system's safe operation, especially in safety-critical applications such as automotive systems. A SE-DL system outputs both the class prediction and an explanation for that prediction, which provides insight into how the system makes its predictions for humans. Additionally, we leverage the explanation of the proposed SE-DL system to detect potential class prediction errors of the system. The proposed SE-DL system uses a set of concepts to generate the explanation. The concepts are human-understandable lower-level image features in each input image relevant to the higher-level class of that image. We present a concept selection methodology for scoring all concepts and selecting a subset of them…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
