Variable Selection via Thompson Sampling

Yi Liu; Veronika Rockova

arXiv:2007.00187·cs.LG·February 15, 2021

Variable Selection via Thompson Sampling

Yi Liu, Veronika Rockova

PDF

Open Access

TL;DR

This paper introduces Thompson Variable Selection (TVS), a Bayesian-inspired stochastic method for subset selection that is flexible, robust, and effective for high-dimensional, non-parametric machine learning models, with strong empirical results.

Contribution

The paper proposes TVS, a novel stochastic optimization framework for variable selection that extends Bayesian methods to non-parametric models and large datasets, applicable in offline and online settings.

Findings

01

Strong empirical performance on simulated data

02

Robustness due to stochastic approach, less prone to local convergence

03

Regret bounds for bandit-based variable selection

Abstract

Thompson sampling is a heuristic algorithm for the multi-armed bandit problem which has a long tradition in machine learning. The algorithm has a Bayesian spirit in the sense that it selects arms based on posterior samples of reward probabilities of each arm. By forging a connection between combinatorial binary bandits and spike-and-slab variable selection, we propose a stochastic optimization approach to subset selection called Thompson Variable Selection (TVS). TVS is a framework for interpretable machine learning which does not rely on the underlying model to be linear. TVS brings together Bayesian reinforcement and machine learning in order to extend the reach of Bayesian subset selection to non-parametric models and large datasets with very many predictors and/or very many observations. Depending on the choice of a reward, TVS can be deployed in offline as well as online setups…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics