Variable Selection via Thompson Sampling
Yi Liu, Veronika Rockova

TL;DR
This paper introduces Thompson Variable Selection (TVS), a Bayesian-inspired stochastic method for subset selection that is flexible, robust, and effective for high-dimensional, non-parametric machine learning models, with strong empirical results.
Contribution
The paper proposes TVS, a novel stochastic optimization framework for variable selection that extends Bayesian methods to non-parametric models and large datasets, applicable in offline and online settings.
Findings
Strong empirical performance on simulated data
Robustness due to stochastic approach, less prone to local convergence
Regret bounds for bandit-based variable selection
Abstract
Thompson sampling is a heuristic algorithm for the multi-armed bandit problem which has a long tradition in machine learning. The algorithm has a Bayesian spirit in the sense that it selects arms based on posterior samples of reward probabilities of each arm. By forging a connection between combinatorial binary bandits and spike-and-slab variable selection, we propose a stochastic optimization approach to subset selection called Thompson Variable Selection (TVS). TVS is a framework for interpretable machine learning which does not rely on the underlying model to be linear. TVS brings together Bayesian reinforcement and machine learning in order to extend the reach of Bayesian subset selection to non-parametric models and large datasets with very many predictors and/or very many observations. Depending on the choice of a reward, TVS can be deployed in offline as well as online setups…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics
