Logic-Based Explainability in Machine Learning

Joao Marques-Silva

arXiv:2211.00541·cs.AI·January 31, 2023·5 cites

Logic-Based Explainability in Machine Learning

Joao Marques-Silva

PDF

Open Access

TL;DR

This paper reviews research on formal, logic-based explanations for machine learning models, emphasizing the importance of rigorous, interpretable explanations for high-stakes applications.

Contribution

It provides an overview of the current state of formal explanation methods in ML, including definitions, complexity, logical encodings, and interpretability strategies.

Findings

01

Highlights the limitations of non-formal explanations

02

Summarizes logical approaches for rigorous explanations

03

Discusses challenges in making explanations human-interpretable

Abstract

The last decade witnessed an ever-increasing stream of successes in Machine Learning (ML). These successes offer clear evidence that ML is bound to become pervasive in a wide range of practical uses, including many that directly affect humans. Unfortunately, the operation of the most successful ML models is incomprehensible for human decision makers. As a result, the use of ML models, especially in high-risk and safety-critical settings is not without concern. In recent years, there have been efforts on devising approaches for explaining ML models. Most of these efforts have focused on so-called model-agnostic approaches. However, all model-agnostic and related approaches offer no guarantees of rigor, hence being referred to as non-formal. For example, such non-formal explanations can be consistent with different predictions, which renders them useless in practice. This paper overviews…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification