CEnt: An Entropy-based Model-agnostic Explainability Framework to Contrast Classifiers' Decisions
Julia El Zini, Mohammad Mansour, Mariette Awad

TL;DR
CEnt is a novel, model-agnostic, entropy-based framework that generates diverse, feasible counterfactual explanations for classifiers, addressing mutability constraints and extending to image and text data.
Contribution
It introduces CEnt, a non-gradient-based contrastive explanation method that approximates models with decision trees and uses entropy to generate diverse counterfactuals respecting mutability constraints.
Findings
Outperforms existing methods in proximity rates
Generates feasible counterfactuals respecting mutability constraints
Extends to image and text data for interpretability and vulnerability detection
Abstract
Current interpretability methods focus on explaining a particular model's decision through present input features. Such methods do not inform the user of the sufficient conditions that alter these decisions when they are not desirable. Contrastive explanations circumvent this problem by providing explanations of the form "If the feature , the output would be different''. While different approaches are developed to find contrasts; these methods do not all deal with mutability and attainability constraints. In this work, we present a novel approach to locally contrast the prediction of any classifier. Our Contrastive Entropy-based explanation method, CEnt, approximates a model locally by a decision tree to compute entropy information of different feature splits. A graph, G, is then built where contrast nodes are found through a one-to-many shortest path search. Contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Adversarial Robustness in Machine Learning
MethodsCounterfactuals Explanations
