TreeView: Peeking into Deep Neural Networks Via Feature-Space Partitioning
Jayaraman J. Thiagarajan, Bhavya Kailkhura, Prasanna Sattigeri and, Karthikeyan Natesan Ramamurthy

TL;DR
TreeView offers a novel method to interpret deep neural networks by hierarchically partitioning feature space, providing insights into prediction processes without sacrificing model accuracy.
Contribution
The paper introduces TreeView, a new approach for interpreting deep models through feature-space partitioning that maintains high accuracy while enhancing interpretability.
Findings
TreeView effectively visualizes the decision process of neural networks.
It reveals how models reject unlikely classes iteratively.
Provides clearer understanding of model predictions.
Abstract
With the advent of highly predictive but opaque deep learning models, it has become more important than ever to understand and explain the predictions of such models. Existing approaches define interpretability as the inverse of complexity and achieve interpretability at the cost of accuracy. This introduces a risk of producing interpretable but misleading explanations. As humans, we are prone to engage in this kind of behavior \cite{mythos}. In this paper, we take a step in the direction of tackling the problem of interpretability without compromising the model accuracy. We propose to build a Treeview representation of the complex model via hierarchical partitioning of the feature space, which reveals the iterative rejection of unlikely class labels until the correct association is predicted.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Advanced Neural Network Applications
MethodsInterpretability
