A Unified Confidence Sequence for Generalized Linear Models, with   Applications to Bandits

Junghyun Lee; Se-Young Yun; Kwang-Sung Jun

arXiv:2407.13977·stat.ML·January 16, 2025

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a unified, tight confidence sequence for generalized linear models, enabling improved bandit algorithms with state-of-the-art regret bounds and practical performance.

Contribution

It develops a novel likelihood ratio-based confidence sequence for GLMs, including Bernoulli, and applies it to create an optimistic bandit algorithm with improved regret guarantees.

Findings

01

Unified confidence sequence for various GLMs.

02

State-of-the-art regret bounds for generalized linear bandits.

03

Numerical results show competitive or superior performance.

Abstract

We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is guaranteed to be convex and numerically tight. We show that this is on par or improves upon known CSs for various GLMs, including Gaussian, Bernoulli, and Poisson. In particular, for the first time, our CS for Bernoulli has a $poly (S)$ -free radius where $S$ is the norm of the unknown parameter. Our first technical novelty is its derivation, which utilizes a time-uniform PAC-Bayesian bound with a uniform prior/posterior, despite the latter being a rather unpopular choice for deriving CSs. As a direct application of our new CS, we propose a simple and natural optimistic algorithm called OFUGLB, applicable to any generalized linear bandits (GLB; Filippi et al. (2010)). Our analysis shows that the celebrated optimistic approach simultaneously attains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nick-jhlee/logistic_bandit
noneOfficial

Videos

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Data Stream Mining Techniques