Interpreting random forest classification models using a feature   contribution method

Anna Palczewska; Jan Palczewski; Richard Marchese Robinson and; Daniel Neagu

arXiv:1312.1121·cs.LG·December 5, 2013·2 cites

Interpreting random forest classification models using a feature contribution method

Anna Palczewska, Jan Palczewski, Richard Marchese Robinson and, Daniel Neagu

PDF

Open Access

TL;DR

This paper introduces a method to interpret random forest models by computing feature contributions, enabling understanding of variable influence on individual predictions and assessing model reliability.

Contribution

It presents a novel approach for calculating feature contributions in random forests, enhancing interpretability and model assessment for black box classifiers.

Findings

01

Effective identification of significant variables

02

Discovery of class-specific contribution patterns

03

Robustness demonstrated across multiple models

Abstract

Model interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the availability of model parameters and their statistical significance. For "black box" models, such as random forest, this information is hidden inside the model structure. This work presents an approach for computing feature contributions for random forest classification models. It allows for the determination of the influence of each variable on the model prediction for an individual instance. By analysing feature contributions for a training dataset, the most significant variables can be determined and their typical contribution towards predictions made for individual classes, i.e., class-specific feature contribution "patterns", are discovered. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Gaussian Processes and Bayesian Inference