On The Reasons Behind Decisions

Adnan Darwiche; Auguste Hirth

arXiv:2002.09284·cs.AI·April 21, 2020·63 cites

On The Reasons Behind Decisions

Adnan Darwiche, Auguste Hirth

PDF

Open Access

TL;DR

This paper develops a theoretical framework for understanding the reasons behind decisions made by Boolean classifiers, introducing notions like sufficient and necessary reasons, and providing algorithms for their computation.

Contribution

It introduces a formal theory for explaining Boolean classifier decisions, including new notions and efficient algorithms, with practical case study applications.

Findings

01

Defined notions of sufficient, necessary, and complete reasons behind decisions

02

Developed algorithms for computing decision reasons efficiently

03

Applied the framework to a real-world case study

Abstract

Recent work has shown that some common machine learning classifiers can be compiled into Boolean circuits that have the same input-output behavior. We present a theory for unveiling the reasons behind the decisions made by Boolean classifiers and study some of its theoretical and practical implications. We define notions such as sufficient, necessary and complete reasons behind decisions, in addition to classifier and decision bias. We show how these notions can be used to evaluate counterfactual statements such as "a decision will stick even if ... because ... ." We present efficient algorithms for computing these notions, which are based on new advances on tractable Boolean circuits, and illustrate them using a case study.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Machine Learning and Algorithms