Understanding complex predictive models with Ghost Variables

Pedro Delicado; Daniel Pe\~na

arXiv:1912.06407·stat.ML·February 17, 2020

Understanding complex predictive models with Ghost Variables

Pedro Delicado, Daniel Pe\~na

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method to assess the relevance of variables in complex predictive models, including neural networks, by comparing predictions with ghost variables and analyzing joint effects through eigenvalues.

Contribution

It proposes a new relevance measure for explanatory variables that captures both individual and joint effects in complex models, extending beyond traditional significance measures.

Findings

01

Relevance measures align with standard significance in simple models.

02

Method reveals joint effects in neural networks not accessible by traditional methods.

03

Illustrated with simulated data and a large real dataset.

Abstract

We propose a procedure for assigning a relevance measure to each explanatory variable in a complex predictive model. We assume that we have a training set to fit the model and a test set to check the out of sample performance. First, the individual relevance of each variable is computed by comparing the predictions in the test set, given by the model that includes all the variables with those of another model in which the variable of interest is substituted by its ghost variable, defined as the prediction of this variable by using the rest of explanatory variables. Second, we check the joint effects among the variables by using the eigenvalues of a relevance matrix that is the covariance matrix of the vectors of individual effects. It is shown that in simple models, as linear or additive models, the proposed measures are related to standard measures of significance of the variables and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pedrodelicado/GhostVariables
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsTest