A Symbolic Approach to Explaining Bayesian Network Classifiers

Andy Shih; Arthur Choi; Adnan Darwiche

arXiv:1805.03364·cs.AI·May 11, 2018

A Symbolic Approach to Explaining Bayesian Network Classifiers

Andy Shih, Arthur Choi, Adnan Darwiche

PDF

TL;DR

This paper introduces a symbolic method for explaining Bayesian network classifiers by compiling them into decision functions, offering minimal feature-based explanations for classifications.

Contribution

It presents algorithms for generating two types of explanations and demonstrates their application to Naive and Latent-Tree Bayesian classifiers using ODDs.

Findings

01

Effective explanations for classifier decisions

02

Compilation of classifiers into ordered decision diagrams

03

Case studies validating the approach

Abstract

We propose an approach for explaining Bayesian network classifiers, which is based on compiling such classifiers into decision functions that have a tractable and symbolic form. We introduce two types of explanations for why a classifier may have classified an instance positively or negatively and suggest algorithms for computing these explanations. The first type of explanation identifies a minimal set of the currently active features that is responsible for the current classification, while the second type of explanation identifies a minimal set of features whose current state (active or not) is sufficient for the classification. We consider in particular the compilation of Naive and Latent-Tree Bayesian network classifiers into Ordered Decision Diagrams (ODDs), providing a context for evaluating our proposal using case studies and experiments based on classifiers from the literature.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.