Unboxing Tree Ensembles for interpretability: a hierarchical   visualization tool and a multivariate optimal re-built tree

Giulia Di Teodoro; Marta Monaci; Laura Palagi

arXiv:2302.07580·math.OC·January 19, 2024

Unboxing Tree Ensembles for interpretability: a hierarchical visualization tool and a multivariate optimal re-built tree

Giulia Di Teodoro, Marta Monaci, Laura Palagi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical visualization tool and a MILP-based method to create an interpretable, sparse, multivariate tree that approximates complex ensemble models, enhancing understanding of their decision processes.

Contribution

It develops a novel hierarchical visualization and an MILP formulation for building an interpretable surrogate tree that mimics ensemble models with feature importance considerations.

Findings

01

The surrogate tree effectively approximates ensemble predictions.

02

The visualization highlights feature importance and usage levels.

03

The MILP approach produces shallow, interpretable trees with high fidelity.

Abstract

The interpretability of models has become a crucial issue in Machine Learning because of algorithmic decisions' growing impact on real-world applications. Tree ensemble methods, such as Random Forests or XgBoost, are powerful learning tools for classification tasks. However, while combining multiple trees may provide higher prediction quality than a single one, it sacrifices the interpretability property resulting in "black-box" models. In light of this, we aim to develop an interpretable representation of a tree-ensemble model that can provide valuable insights into its behavior. First, given a target tree-ensemble model, we develop a hierarchical visualization tool based on a heatmap representation of the forest's feature use, considering the frequency of a feature and the level at which it is selected as an indicator of importance. Next, we propose a mixed-integer linear programming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gditeodoro/VITE
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsFeature Selection · Heatmap