Local MDI+: Local Feature Importances for Tree-Based Models
Zhongyuan Liang, Zachary T. Rewolinski, Abhineet Agarwal, Tiffany M. Tang, Bin Yu

TL;DR
This paper introduces LMDI+, a local feature importance method for tree-based models that improves interpretability and stability over existing approaches like LIME and TreeSHAP, especially in heterogeneous data settings.
Contribution
LMDI+ extends the global MDI+ framework to provide accurate, stable, and instance-specific feature importance explanations for tree ensembles.
Findings
LMDI+ outperforms LIME and TreeSHAP in identifying relevant features.
LMDI+ achieves a 10% average improvement in downstream task performance.
LMDI+ offers stable and consistent feature importance rankings across models.
Abstract
Tree-based ensembles such as random forests remain the go-to for tabular data over deep learning models due to their prediction performance and computational efficiency. These advantages have led to their widespread deployment in high-stakes domains, where interpretability is essential for ensuring trustworthy predictions. This has motivated the development of popular local (i.e. sample-specific) feature importance (LFI) methods such as LIME and TreeSHAP. However, these approaches rely on approximations that ignore the model's internal structure and instead depend on potentially unstable perturbations. These issues are addressed in the global setting by MDI+, a feature importance method which exploits an equivalence between decision trees and linear models on a transformed node basis. However, the global MDI+ scores are not able to explain predictions when faced with heterogeneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
MethodsLocal Interpretable Model-Agnostic Explanations · Counterfactuals Explanations
