Consistent Individualized Feature Attribution for Tree Ensembles

Scott M. Lundberg; Gabriel G. Erion; and Su-In Lee

arXiv:1802.03888·cs.LG·March 8, 2019·552 cites

Consistent Individualized Feature Attribution for Tree Ensembles

Scott M. Lundberg, Gabriel G. Erion, and Su-In Lee

PDF

Open Access 5 Repos

TL;DR

This paper introduces a fast, exact method for computing consistent, individualized feature attributions in tree ensemble models using SHAP values, improving interpretability and clustering of features.

Contribution

The paper develops a novel, efficient algorithm for exact SHAP value computation in tree ensembles, ensuring consistency and extending to interaction effects.

Findings

01

Exact SHAP values improve interpretability.

02

Enhanced clustering based on feature attributions.

03

Better alignment with human intuition.

Abstract

Interpreting predictions from tree ensemble methods such as gradient boosting machines and random forests is important, yet feature attribution for trees is often heuristic and not individualized for each prediction. Here we show that popular feature attribution methods are inconsistent, meaning they can lower a feature's assigned importance when the true impact of that feature actually increases. This is a fundamental problem that casts doubt on any comparison between features. To address it we turn to recent applications of game theory and develop fast exact tree solutions for SHAP (SHapley Additive exPlanation) values, which are the unique consistent and locally accurate attribution values. We then extend SHAP values to interaction effects and define SHAP interaction values. We propose a rich visualization of individualized feature attributions that improves over classic attribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForest ecology and management · Explainable Artificial Intelligence (XAI) · Data Analysis with R