CLIMAX: An exploration of Classifier-Based Contrastive Explanations

Praharsh Nanavati; Ranjitha Prasad

arXiv:2307.00680·cs.LG·July 4, 2023·1 cites

CLIMAX: An exploration of Classifier-Based Contrastive Explanations

Praharsh Nanavati, Ranjitha Prasad

PDF

Open Access 1 Repo

TL;DR

CLIMAX introduces a novel contrastive, class-aware explanation method for black-box models, improving consistency and reliability of post-hoc interpretability across textual and image data.

Contribution

It proposes a new model-agnostic, contrastive explanation technique using local classifiers and label-aware data generation, enhancing explanation fidelity and class distinction.

Findings

01

Outperforms LIME, BayLIME, and SLIME in consistency

02

Provides contrastive explanations for textual and image datasets

03

Ensures class-balanced surrogate data for better fidelity

Abstract

Explainable AI is an evolving area that deals with understanding the decision making of machine learning models so that these models are more transparent, accountable, and understandable for humans. In particular, post-hoc model-agnostic interpretable AI techniques explain the decisions of a black-box ML model for a single instance locally, without the knowledge of the intrinsic nature of the ML model. Despite their simplicity and capability in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations. Moreover, in the context of black-box classifiers, existing approaches justify the predicted class, but these methods do not ensure that the explanation scores strongly differ as compared to those of another class. In this work we propose a novel post-hoc model agnostic XAI technique that provides contrastive explanations justifying the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

niftynans/climax
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

Methodsfail · Local Interpretable Model-Agnostic Explanations