Dependency Decomposition and a Reject Option for Explainable Models
Jan Kronenberger, Anselm Haselhoff

TL;DR
This paper introduces Explanation Dependency Decomposition (EDD) to analyze dependencies between classification outputs and explanations in deep learning, proposing methods to generate explanations and a reject option for verification, enhancing explainability and robustness.
Contribution
It is the first to analyze dependencies between classification probabilities and explanations, proposing EDD and methods for explanation generation and prediction verification.
Findings
EDD reveals dependencies between outputs and explanations
Proposed explanation methods improve interpretability
Reject option helps verify model predictions
Abstract
Deploying machine learning models in safety-related do-mains (e.g. autonomous driving, medical diagnosis) demands for approaches that are explainable, robust against adversarial attacks and aware of the model uncertainty. Recent deep learning models perform extremely well in various inference tasks, but the black-box nature of these approaches leads to a weakness regarding the three requirements mentioned above. Recent advances offer methods to visualize features, describe attribution of the input (e.g.heatmaps), provide textual explanations or reduce dimensionality. However,are explanations for classification tasks dependent or are they independent of each other? For in-stance, is the shape of an object dependent on the color? What is the effect of using the predicted class for generating explanations and vice versa? In the context of explainable deep learning models, we present the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
