Model Agnostic Local Explanations of Reject

Andr\'e Artelt; Roel Visser; Barbara Hammer

arXiv:2205.07623·cs.AI·May 17, 2022·1 cites

Model Agnostic Local Explanations of Reject

Andr\'e Artelt, Roel Visser, Barbara Hammer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a model-agnostic approach to locally explain reject options in machine learning systems, using interpretable models and counterfactuals to clarify why samples are rejected, which is crucial for safety-critical applications.

Contribution

It presents a novel method for explaining reject decisions in machine learning, addressing the open problem of interpretability for reject options.

Findings

01

Enables local explanations for reject decisions

02

Uses interpretable models and counterfactuals for explanations

03

Applicable to various reject options in ML systems

Abstract

The application of machine learning based decision making systems in safety critical areas requires reliable high certainty predictions. Reject options are a common way of ensuring a sufficiently high certainty of predictions made by the system. While being able to reject uncertain samples is important, it is also of importance to be able to explain why a particular sample was rejected. However, explaining general reject options is still an open problem. We propose a model agnostic method for locally explaining arbitrary reject options by means of interpretable models and counterfactual explanations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andreartelt/localmodelagnosticexplanationreject
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Fault Detection and Control Systems · Bayesian Modeling and Causal Inference