Unboxing Tree Ensembles for interpretability: a hierarchical visualization tool and a multivariate optimal re-built tree
Giulia Di Teodoro, Marta Monaci, Laura Palagi

TL;DR
This paper introduces a hierarchical visualization tool and a MILP-based method to create an interpretable, sparse, multivariate tree that approximates complex ensemble models, enhancing understanding of their decision processes.
Contribution
It develops a novel hierarchical visualization and an MILP formulation for building an interpretable surrogate tree that mimics ensemble models with feature importance considerations.
Findings
The surrogate tree effectively approximates ensemble predictions.
The visualization highlights feature importance and usage levels.
The MILP approach produces shallow, interpretable trees with high fidelity.
Abstract
The interpretability of models has become a crucial issue in Machine Learning because of algorithmic decisions' growing impact on real-world applications. Tree ensemble methods, such as Random Forests or XgBoost, are powerful learning tools for classification tasks. However, while combining multiple trees may provide higher prediction quality than a single one, it sacrifices the interpretability property resulting in "black-box" models. In light of this, we aim to develop an interpretable representation of a tree-ensemble model that can provide valuable insights into its behavior. First, given a target tree-ensemble model, we develop a hierarchical visualization tool based on a heatmap representation of the forest's feature use, considering the frequency of a feature and the level at which it is selected as an indicator of importance. Next, we propose a mixed-integer linear programming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
MethodsFeature Selection · Heatmap
