Efficient Decompositional Rule Extraction for Deep Neural Networks
Mateo Espinosa Zarlenga, Zohreh Shams, Mateja Jamnik

TL;DR
ECLAIRE is a new polynomial-time rule extraction algorithm that scales efficiently to large DNNs and datasets, providing more accurate and understandable rules than existing methods, with open-source tools available.
Contribution
The paper introduces ECLAIRE, a scalable, polynomial-time decompositional rule extraction method for deep neural networks, overcoming previous limitations of intractability and single-layer restrictions.
Findings
ECLAIRE outperforms state-of-the-art rule extraction methods in accuracy.
ECLAIRE requires significantly less computational resources.
ECLAIRE effectively handles large DNN architectures and datasets.
Abstract
In recent years, there has been significant work on increasing both interpretability and debuggability of a Deep Neural Network (DNN) by extracting a rule-based model that approximates its decision boundary. Nevertheless, current DNN rule extraction methods that consider a DNN's latent space when extracting rules, known as decompositional algorithms, are either restricted to single-layer DNNs or intractable as the size of the DNN or data grows. In this paper, we address these limitations by introducing ECLAIRE, a novel polynomial-time rule extraction algorithm capable of scaling to both large DNN architectures and large training datasets. We evaluate ECLAIRE on a wide variety of tasks, ranging from breast cancer prognosis to particle detection, and show that it consistently extracts more accurate and comprehensible rule sets than the current state-of-the-art methods while using orders…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference
