Extracting Interpretable Concept-Based Decision Trees from CNNs

Conner Chyung; Michael Tsang; Yan Liu

arXiv:1906.04664·cs.LG·June 18, 2019·1 cites

Extracting Interpretable Concept-Based Decision Trees from CNNs

Conner Chyung, Michael Tsang, Yan Liu

PDF

Open Access

TL;DR

This paper introduces a method to interpret CNNs by extracting concept-based decision trees from hidden layer activations, enabling better understanding of model reasoning with human-understandable concepts.

Contribution

The paper presents a novel approach to derive interpretable decision trees from CNNs that reveal concept importance and interactions, enhancing model transparency.

Findings

01

Decision trees accurately represent CNN classifications at low depths.

02

The method enables human-in-the-loop understanding of CNN concepts.

03

Extracted trees highlight concept importance and interactions.

Abstract

In an attempt to gather a deeper understanding of how convolutional neural networks (CNNs) reason about human-understandable concepts, we present a method to infer labeled concept data from hidden layer activations and interpret the concepts through a shallow decision tree. The decision tree can provide information about which concepts a model deems important, as well as provide an understanding of how the concepts interact with each other. Experiments demonstrate that the extracted decision tree is capable of accurately representing the original CNN's classifications at low tree depths, thus encouraging human-in-the-loop understanding of discriminative concepts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification