Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations

Shahaf Bassan; Yizhak Yisrael Elboher; Tobias Ladner; Matthias Althoff; Guy Katz

arXiv:2506.08505·cs.LG·June 11, 2025

Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations

Shahaf Bassan, Yizhak Yisrael Elboher, Tobias Ladner, Matthias Althoff, Guy Katz

PDF

Open Access 1 Video

TL;DR

This paper introduces an abstraction-refinement method that efficiently computes provably sufficient explanations for neural network predictions, balancing formal guarantees with scalability and interpretability.

Contribution

It proposes a novel abstraction-refinement technique that reduces neural network size for faster provable explanations, with iterative refinement for accuracy.

Findings

01

Significantly improves explanation computation speed

02

Provides fine-grained interpretation across abstraction levels

03

Maintains formal guarantees of explanation sufficiency

Abstract

Despite significant advancements in post-hoc explainability techniques for neural networks, many current methods rely on heuristics and do not provide formally provable guarantees over the explanations provided. Recent work has shown that it is possible to obtain explanations with formal guarantees by identifying subsets of input features that are sufficient to determine that predictions remain unchanged using neural network verification techniques. Despite the appeal of these explanations, their computation faces significant scalability challenges. In this work, we address this gap by proposing a novel abstraction-refinement technique for efficiently computing provably sufficient explanations of neural network predictions. Our method abstracts the original large neural network by constructing a substantially reduced network, where a sufficient explanation of the reduced network is also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science