Interpreting Shared Deep Learning Models via Explicable Boundary Trees

Huijun Wu; Chen Wang; Jie Yin; Kai Lu; Liming Zhu

arXiv:1709.03730·cs.LG·September 14, 2017·5 cites

Interpreting Shared Deep Learning Models via Explicable Boundary Trees

Huijun Wu, Chen Wang, Jie Yin, Kai Lu, Liming Zhu

PDF

Open Access

TL;DR

This paper introduces a method to interpret complex deep learning models by constructing a boundary tree from a small, privacy-preserving subset of training data, enhancing transparency and trust in model sharing.

Contribution

The paper proposes a novel approach to interpret deep models using boundary trees built from limited training data, improving understanding without sharing full datasets.

Findings

01

Boundary trees approximate complex models with high fidelity.

02

Traversing the tree improves user understanding of model decisions.

03

Method enhances trust in shared models under privacy constraints.

Abstract

Despite outperforming the human in many tasks, deep neural network models are also criticized for the lack of transparency and interpretability in decision making. The opaqueness results in uncertainty and low confidence when deploying such a model in model sharing scenarios, when the model is developed by a third party. For a supervised machine learning model, sharing training process including training data provides an effective way to gain trust and to better understand model predictions. However, it is not always possible to share all training data due to privacy and policy constraints. In this paper, we propose a method to disclose a small set of training data that is just sufficient for users to get the insight of a complicated model. The method constructs a boundary tree using selected training data and the tree is able to approximate the complicated model with high fidelity. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Data Stream Mining Techniques

MethodsInterpretability