DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
Ricardo Moreira, Jacopo Bono, M\'ario Cardoso, Pedro Saleiro, M\'ario, A. T. Figueiredo, Pedro Bizarro

TL;DR
DiConStruct is a novel explanation method that provides causal, concept-based explanations for black-box models, maintaining high fidelity and efficiency without sacrificing predictive performance.
Contribution
It introduces a causal, concept-based explanation approach that distills black-box models into structural causal models, capturing causal relations and ensuring efficiency.
Findings
Higher fidelity in approximating black-box models compared to baselines
Provides causal relations between concepts in explanations
Maintains prediction performance while explaining models
Abstract
Model interpretability plays a central role in human-AI decision-making systems. Ideally, explanations should be expressed using human-interpretable semantic concepts. Moreover, the causal relations between these concepts should be captured by the explainer to allow for reasoning about the explanations. Lastly, explanation methods should be efficient and not compromise the performance of the predictive task. Despite the rapid advances in AI explainability in recent years, as far as we know to date, no method fulfills these three properties. Indeed, mainstream methods for local concept explainability do not produce causal explanations and incur a trade-off between explainability and prediction performance. We present DiConStruct, an explanation method that is both concept-based and causal, with the goal of creating more interpretable local explanations in the form of structural causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Advanced Neural Network Applications
