On the Trustworthiness of Tree Ensemble Explainability Methods

Angeline Yasodhara; Azin Asgarian; Diego Huang; Parinaz Sobhani

arXiv:2110.00086·cs.LG·October 4, 2021

On the Trustworthiness of Tree Ensemble Explainability Methods

Angeline Yasodhara, Azin Asgarian, Diego Huang, Parinaz Sobhani

PDF

TL;DR

This paper evaluates the accuracy and stability of global feature importance methods for tree ensemble models, revealing their limitations in noisy and high-dimensional settings through comprehensive experiments.

Contribution

It provides the first systematic comparison of global feature importance methods' accuracy and stability, highlighting their limitations in real-world scenarios.

Findings

01

Global feature importance methods lack accuracy with noisy inputs.

02

These methods are unstable under increased input dimension and data noise.

03

Model perturbations due to different initializations affect explanation stability.

Abstract

The recent increase in the deployment of machine learning models in critical domains such as healthcare, criminal justice, and finance has highlighted the need for trustworthy methods that can explain these models to stakeholders. Feature importance methods (e.g. gain and SHAP) are among the most popular explainability methods used to address this need. For any explainability technique to be trustworthy and meaningful, it has to provide an explanation that is accurate and stable. Although the stability of local feature importance methods (explaining individual predictions) has been studied before, there is yet a knowledge gap about the stability of global features importance methods (explanations for the whole model). Additionally, there is no study that evaluates and compares the accuracy of global feature importance methods with respect to feature ordering. In this paper, we evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.