Scalable Generalized Linear Bandits: Online Computation and Hashing

Kwang-Sung Jun; Aniruddha Bhargava; Robert Nowak; Rebecca Willett

arXiv:1706.00136·stat.ML·October 24, 2017·31 cites

Scalable Generalized Linear Bandits: Online Computation and Hashing

Kwang-Sung Jun, Aniruddha Bhargava, Robert Nowak, Rebecca Willett

PDF

Open Access

TL;DR

This paper introduces scalable algorithms for generalized linear bandits that achieve constant per-step complexity and sublinear time in the number of arms, using online learning extensions and hashing techniques.

Contribution

It presents a novel GLOC method for scalable GLBs, a hash-amenable algorithm with improved regret bounds, and a fast approximate hash-key computation method.

Findings

01

Constant space and time complexity per step.

02

Sublinear time complexity in the number of arms.

03

Improved regret bounds with hash-amenable algorithms.

Abstract

Generalized Linear Bandits (GLBs), a natural extension of the stochastic linear bandits, has been popular and successful in recent years. However, existing GLBs scale poorly with the number of rounds and the number of arms, limiting their utility in practice. This paper proposes new, scalable solutions to the GLB problem in two respects. First, unlike existing GLBs, whose per-time-step space and time complexity grow at least linearly with time $t$ , we propose a new algorithm that performs online computations to enjoy a constant space and time complexity. At its heart is a novel Generalized Linear extension of the Online-to-confidence-set Conversion (GLOC method) that takes \emph{any} online learning algorithm and turns it into a GLB algorithm. As a special case, we apply GLOC to the online Newton step algorithm, which results in a low-regret GLB algorithm with much lower time and memory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques