MAIRE -- A Model-Agnostic Interpretable Rule Extraction Procedure for Explaining Classifiers
Rajat Sharma, Nikhil Reddy, Vidhya Kamakshi, Narayanan C Krishnan,, Shweta Jain

TL;DR
MAIRE is a versatile, model-agnostic method that extracts human-interpretable rules in the form of hyper-cuboids to explain classifier decisions across various data types and domains.
Contribution
The paper introduces a novel gradient-based optimization framework for extracting high-coverage, high-precision, human-interpretable rules that explain any classifier's output.
Findings
Effective rule extraction demonstrated on diverse datasets.
Theoretical analysis confirms approximation quality.
Heuristics improve interpretability of explanations.
Abstract
The paper introduces a novel framework for extracting model-agnostic human interpretable rules to explain a classifier's output. The human interpretable rule is defined as an axis-aligned hyper-cuboid containing the instance for which the classification decision has to be explained. The proposed procedure finds the largest (high \textit{coverage}) axis-aligned hyper-cuboid such that a high percentage of the instances in the hyper-cuboid have the same class label as the instance being explained (high \textit{precision}). Novel approximations to the coverage and precision measures in terms of the parameters of the hyper-cuboid are defined. They are maximized using gradient-based optimizers. The quality of the approximations is rigorously analyzed theoretically and experimentally. Heuristics for simplifying the generated explanations for achieving better interpretability and a greedy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
