An Interpretable Probabilistic Approach for Demystifying Black-box   Predictive Models

Catarina Moreira; Yu-Liang Chou; Mythreyi Velmurugan; Chun; Ouyang; Renuka Sindhgatta; Peter Bruza

arXiv:2007.10668·cs.AI·July 22, 2020·1 cites

An Interpretable Probabilistic Approach for Demystifying Black-box Predictive Models

Catarina Moreira, Yu-Liang Chou, Mythreyi Velmurugan, Chun, Ouyang, Renuka Sindhgatta, Peter Bruza

PDF

Open Access

TL;DR

This paper introduces a Bayesian network-based method for post hoc interpretation of black-box models, providing insights into feature contributions, focused explanations, and confidence assessment, enhancing interpretability of complex machine learning models.

Contribution

The paper presents a novel Bayesian network framework that offers detailed feature contribution explanations, focused interpretations via Markov blankets, and confidence rules for black-box models.

Findings

01

Effective extraction of Bayesian networks approximating black-box models

02

Provides explanations of feature importance and contribution

03

Enables confidence assessment through derived rules

Abstract

The use of sophisticated machine learning models for critical decision making is faced with a challenge that these models are often applied as a "black-box". This has led to an increased interest in interpretable machine learning, where post hoc interpretation presents a useful mechanism for generating interpretations of complex learning models. In this paper, we propose a novel approach underpinned by an extended framework of Bayesian networks for generating post hoc interpretations of a black-box predictive model. The framework supports extracting a Bayesian network as an approximation of the black-box model for a specific prediction. Compared to the existing post hoc interpretation methods, the contribution of our approach is three-fold. Firstly, the extracted Bayesian network, as a probabilistic graphical model, can provide interpretations about not only what input features but also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification