Knowledge Distillation Decision Tree for Unravelling Black-box Machine Learning Models
Xuetao Lu, J. Jack Lee

TL;DR
This paper introduces a novel method called KDDT that distills knowledge from black-box models into decision trees, enhancing interpretability while maintaining accuracy and stability, supported by theoretical foundations and empirical validation.
Contribution
The paper proposes the KDDT method for interpreting black-box models through stable and predictive decision trees, including a hybrid version with an efficient construction algorithm.
Findings
KDDT achieves structural stability under mild assumptions.
Hybrid KDDT balances simplicity and predictivity effectively.
Simulation and real-data studies confirm the method's accuracy and reliability.
Abstract
Machine learning models, particularly the black-box models, are widely favored for their outstanding predictive capabilities. However, they often face scrutiny and criticism due to the lack of interpretability. Paradoxically, their strong predictive capabilities may indicate a deep understanding of the underlying data, implying significant potential for interpretation. Leveraging the emerging concept of knowledge distillation, we introduce the method of knowledge distillation decision tree (KDDT). This method enables the distillation of knowledge about the data from a black-box model into a decision tree, thereby facilitating the interpretation of the black-box model. Essential attributes for a good interpretable model include simplicity, stability, and predictivity. The primary challenge of constructing interpretable tree lies in ensuring structural stability under the randomness of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
MethodsKnowledge Distillation
