CEnt: An Entropy-based Model-agnostic Explainability Framework to   Contrast Classifiers' Decisions

Julia El Zini; Mohammad Mansour; Mariette Awad

arXiv:2301.07941·cs.LG·January 20, 2023·1 cites

CEnt: An Entropy-based Model-agnostic Explainability Framework to Contrast Classifiers' Decisions

Julia El Zini, Mohammad Mansour, Mariette Awad

PDF

Open Access 1 Repo

TL;DR

CEnt is a novel, model-agnostic, entropy-based framework that generates diverse, feasible counterfactual explanations for classifiers, addressing mutability constraints and extending to image and text data.

Contribution

It introduces CEnt, a non-gradient-based contrastive explanation method that approximates models with decision trees and uses entropy to generate diverse counterfactuals respecting mutability constraints.

Findings

01

Outperforms existing methods in proximity rates

02

Generates feasible counterfactuals respecting mutability constraints

03

Extends to image and text data for interpretability and vulnerability detection

Abstract

Current interpretability methods focus on explaining a particular model's decision through present input features. Such methods do not inform the user of the sufficient conditions that alter these decisions when they are not desirable. Contrastive explanations circumvent this problem by providing explanations of the form "If the feature $X > x$ , the output $Y$ would be different''. While different approaches are developed to find contrasts; these methods do not all deal with mutability and attainability constraints. In this work, we present a novel approach to locally contrast the prediction of any classifier. Our Contrastive Entropy-based explanation method, CEnt, approximates a model locally by a decision tree to compute entropy information of different feature splits. A graph, G, is then built where contrast nodes are found through a one-to-many shortest path search. Contrastive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohamadmansourx/cent
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Adversarial Robustness in Machine Learning

MethodsCounterfactuals Explanations