McXai: Local model-agnostic explanation as two games
Yiran Huang, Nicole Schaal, Michael Hefenbrock, Yexu Zhou, Till, Riedel, Likun Fang, Michael Beigl

TL;DR
McXai introduces a reinforcement learning-based method using Monte Carlo tree search to generate interpretable, feature-dependent explanations for black-box classifiers, improving informativeness and robustness over classical methods.
Contribution
This work presents a novel approach that models explanation generation as two games, providing more informative and human-friendly explanations compared to existing methods like LIME and SHAP.
Findings
Features identified are more informative than classical methods.
The approach can identify misleading features.
Guides towards improved model robustness.
Abstract
To this day, a variety of approaches for providing local interpretability of black-box machine learning models have been introduced. Unfortunately, all of these methods suffer from one or more of the following deficiencies: They are either difficult to understand themselves, they work on a per-feature basis and ignore the dependencies between features and/or they only focus on those features asserting the decision made by the model. To address these points, this work introduces a reinforcement learning-based approach called Monte Carlo tree search for eXplainable Artificial Intelligent (McXai) to explain the decisions of any black-box classification model (classifier). Our method leverages Monte Carlo tree search and models the process of generating explanations as two games. In one game, the reward is maximized by finding feature sets that support the decision of the classifier, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Stock Market Forecasting Methods
MethodsLocal Interpretable Model-Agnostic Explanations · Shapley Additive Explanations
