MAIRE -- A Model-Agnostic Interpretable Rule Extraction Procedure for   Explaining Classifiers

Rajat Sharma; Nikhil Reddy; Vidhya Kamakshi; Narayanan C Krishnan,; Shweta Jain

arXiv:2011.01506·cs.AI·November 4, 2020

MAIRE -- A Model-Agnostic Interpretable Rule Extraction Procedure for Explaining Classifiers

Rajat Sharma, Nikhil Reddy, Vidhya Kamakshi, Narayanan C Krishnan,, Shweta Jain

PDF

TL;DR

MAIRE is a versatile, model-agnostic method that extracts human-interpretable rules in the form of hyper-cuboids to explain classifier decisions across various data types and domains.

Contribution

The paper introduces a novel gradient-based optimization framework for extracting high-coverage, high-precision, human-interpretable rules that explain any classifier's output.

Findings

01

Effective rule extraction demonstrated on diverse datasets.

02

Theoretical analysis confirms approximation quality.

03

Heuristics improve interpretability of explanations.

Abstract

The paper introduces a novel framework for extracting model-agnostic human interpretable rules to explain a classifier's output. The human interpretable rule is defined as an axis-aligned hyper-cuboid containing the instance for which the classification decision has to be explained. The proposed procedure finds the largest (high \textit{coverage}) axis-aligned hyper-cuboid such that a high percentage of the instances in the hyper-cuboid have the same class label as the instance being explained (high \textit{precision}). Novel approximations to the coverage and precision measures in terms of the parameters of the hyper-cuboid are defined. They are maximized using gradient-based optimizers. The quality of the approximations is rigorously analyzed theoretically and experimentally. Heuristics for simplifying the generated explanations for achieving better interpretability and a greedy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability