On the Trustworthiness of Tree Ensemble Explainability Methods
Angeline Yasodhara, Azin Asgarian, Diego Huang, Parinaz Sobhani

TL;DR
This paper evaluates the accuracy and stability of global feature importance methods for tree ensemble models, revealing their limitations in noisy and high-dimensional settings through comprehensive experiments.
Contribution
It provides the first systematic comparison of global feature importance methods' accuracy and stability, highlighting their limitations in real-world scenarios.
Findings
Global feature importance methods lack accuracy with noisy inputs.
These methods are unstable under increased input dimension and data noise.
Model perturbations due to different initializations affect explanation stability.
Abstract
The recent increase in the deployment of machine learning models in critical domains such as healthcare, criminal justice, and finance has highlighted the need for trustworthy methods that can explain these models to stakeholders. Feature importance methods (e.g. gain and SHAP) are among the most popular explainability methods used to address this need. For any explainability technique to be trustworthy and meaningful, it has to provide an explanation that is accurate and stable. Although the stability of local feature importance methods (explaining individual predictions) has been studied before, there is yet a knowledge gap about the stability of global features importance methods (explanations for the whole model). Additionally, there is no study that evaluates and compares the accuracy of global feature importance methods with respect to feature ordering. In this paper, we evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
