Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees
Wenying Deng, Beau Coker, Rajarshi Mukherjee, Jeremiah Zhe Liu, Brent, A. Coull

TL;DR
This paper introduces a unified Bayesian framework for nonlinear variable selection that quantifies uncertainty and guarantees theoretical consistency across various machine learning models, including non-differentiable ones.
Contribution
It proposes a generalizable approach for uncertainty-aware variable importance measurement with theoretical guarantees, applicable to diverse models like trees, kernels, and neural networks.
Findings
Outperforms existing variable selection methods in healthcare datasets
Provides posterior distributions for variable importance, enabling uncertainty quantification
Guarantees posterior consistency and asymptotic properties through rigorous theorems
Abstract
We develop a simple and unified framework for nonlinear variable selection that incorporates uncertainty in the prediction function and is compatible with a wide range of machine learning models (e.g., tree ensembles, kernel methods, neural networks, etc). In particular, for a learned nonlinear model , we consider quantifying the importance of an input variable using the integrated partial derivative . We then (1) provide a principled approach for quantifying variable selection uncertainty by deriving its posterior distribution, and (2) show that the approach is generalizable even to non-differentiable models such as tree ensembles. Rigorous Bayesian nonparametric theorems are derived to guarantee the posterior consistency and asymptotic uncertainty of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Healthcare · Statistical Methods and Inference · Machine Learning and Data Classification
