Extracting Interpretable Models from Tree Ensembles: Computational and Statistical Perspectives
Brian Liu, Rahul Mazumder, Peter Radchenko

TL;DR
This paper introduces a novel estimator for extracting interpretable decision rules from tree ensembles, balancing accuracy with interpretability and providing theoretical guarantees.
Contribution
It proposes a flexible, efficient method with algorithms and error bounds for extracting compact, interpretable models from complex tree ensembles.
Findings
The estimator achieves competitive predictive accuracy compared to the oracle.
It outperforms existing rule extraction algorithms in experiments.
Theoretical bounds show predictive performance comparable to an oracle.
Abstract
Tree ensembles are non-parametric methods widely recognized for their accuracy and ability to capture complex interactions. While these models excel at prediction, they are difficult to interpret and may fail to uncover useful relationships in the data. We propose an estimator to extract compact sets of decision rules from tree ensembles. The extracted models are accurate and can be manually examined to reveal relationships between the predictors and the response. A key novelty of our estimator is the flexibility to jointly control the number of rules extracted and the interaction depth of each rule, which improves accuracy. We develop a tailored exact algorithm to efficiently solve optimization problems underlying our estimator and an approximate algorithm for computing regularization paths, sequences of solutions that correspond to varying model sizes. We also establish novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
