Provably Robust Model-Centric Explanations for Critical Decision-Making

Cecilia G. Morales; Nicholas Gisolfi; Robert Edman; James K. Miller,; Artur Dubrawski

arXiv:2110.13937·cs.LG·October 28, 2021

Provably Robust Model-Centric Explanations for Critical Decision-Making

Cecilia G. Morales, Nicholas Gisolfi, Robert Edman, James K. Miller,, Artur Dubrawski

PDF

Open Access

TL;DR

This paper advocates for a model-centric, SAT-based approach to generate robust explanations for AI models, providing more reliable insights than traditional data-centric methods like LIME and SHAP, especially in critical decision-making scenarios.

Contribution

It introduces a formal SAT-based framework for model-centric explanations, highlighting its advantages over existing data-centric explanation tools in terms of robustness and practical utility.

Findings

01

Model-centric explanations are more robust than data-centric methods.

02

SAT formalism enables actionable insights into AI model risks.

03

Data-centric explanations can be brittle and less reliable.

Abstract

We recommend using a model-centric, Boolean Satisfiability (SAT) formalism to obtain useful explanations of trained model behavior, different and complementary to what can be gleaned from LIME and SHAP, popular data-centric explanation tools in Artificial Intelligence (AI). We compare and contrast these methods, and show that data-centric methods may yield brittle explanations of limited practical utility. The model-centric framework, however, can offer actionable insights into risks of using AI models in practice. For critical applications of AI, split-second decision making is best informed by robust explanations that are invariant to properties of data, the capability offered by model-centric frameworks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning

MethodsLocal Interpretable Model-Agnostic Explanations · Shapley Additive Explanations