Towards a Unified Framework for Uncertainty-aware Nonlinear Variable   Selection with Theoretical Guarantees

Wenying Deng; Beau Coker; Rajarshi Mukherjee; Jeremiah Zhe Liu; Brent; A. Coull

arXiv:2204.07293·stat.ML·May 30, 2022·1 cites

Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees

Wenying Deng, Beau Coker, Rajarshi Mukherjee, Jeremiah Zhe Liu, Brent, A. Coull

PDF

Open Access 1 Video

TL;DR

This paper introduces a unified Bayesian framework for nonlinear variable selection that quantifies uncertainty and guarantees theoretical consistency across various machine learning models, including non-differentiable ones.

Contribution

It proposes a generalizable approach for uncertainty-aware variable importance measurement with theoretical guarantees, applicable to diverse models like trees, kernels, and neural networks.

Findings

01

Outperforms existing variable selection methods in healthcare datasets

02

Provides posterior distributions for variable importance, enabling uncertainty quantification

03

Guarantees posterior consistency and asymptotic properties through rigorous theorems

Abstract

We develop a simple and unified framework for nonlinear variable selection that incorporates uncertainty in the prediction function and is compatible with a wide range of machine learning models (e.g., tree ensembles, kernel methods, neural networks, etc). In particular, for a learned nonlinear model $f (x)$ , we consider quantifying the importance of an input variable $x^{j}$ using the integrated partial derivative $Ψ_{j} = ∥ \frac{\partial}{\partial x ^{j}} f (x) ∥_{P_{X}}^{2}$ . We then (1) provide a principled approach for quantifying variable selection uncertainty by deriving its posterior distribution, and (2) show that the approach is generalizable even to non-differentiable models such as tree ensembles. Rigorous Bayesian nonparametric theorems are derived to guarantee the posterior consistency and asymptotic uncertainty of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees· slideslive

Taxonomy

TopicsMachine Learning in Healthcare · Statistical Methods and Inference · Machine Learning and Data Classification