CLIMAX: An exploration of Classifier-Based Contrastive Explanations
Praharsh Nanavati, Ranjitha Prasad

TL;DR
CLIMAX introduces a novel contrastive, class-aware explanation method for black-box models, improving consistency and reliability of post-hoc interpretability across textual and image data.
Contribution
It proposes a new model-agnostic, contrastive explanation technique using local classifiers and label-aware data generation, enhancing explanation fidelity and class distinction.
Findings
Outperforms LIME, BayLIME, and SLIME in consistency
Provides contrastive explanations for textual and image datasets
Ensures class-balanced surrogate data for better fidelity
Abstract
Explainable AI is an evolving area that deals with understanding the decision making of machine learning models so that these models are more transparent, accountable, and understandable for humans. In particular, post-hoc model-agnostic interpretable AI techniques explain the decisions of a black-box ML model for a single instance locally, without the knowledge of the intrinsic nature of the ML model. Despite their simplicity and capability in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations. Moreover, in the context of black-box classifiers, existing approaches justify the predicted class, but these methods do not ensure that the explanation scores strongly differ as compared to those of another class. In this work we propose a novel post-hoc model agnostic XAI technique that provides contrastive explanations justifying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
Methodsfail · Local Interpretable Model-Agnostic Explanations
