A Jointly Efficient and Optimal Algorithm for Heteroskedastic Generalized Linear Bandits with Adversarial Corruptions

Sanghwa Kim; Junghyun Lee; Se-Young Yun

arXiv:2602.10971·cs.LG·February 12, 2026

A Jointly Efficient and Optimal Algorithm for Heteroskedastic Generalized Linear Bandits with Adversarial Corruptions

Sanghwa Kim, Junghyun Lee, Se-Young Yun

PDF

Open Access

TL;DR

This paper introduces HCW-GLB-OMD, an efficient algorithm for heteroskedastic generalized linear bandits with adversarial corruptions, achieving near-optimal regret bounds across diverse settings.

Contribution

The paper proposes a novel, computationally efficient algorithm that unifies and improves regret bounds for heteroskedastic GLBs under adversarial corruptions.

Findings

01

Achieves regret bounds close to the lower bound, demonstrating near-optimality.

02

Unifies various heteroskedastic bandit settings under a single framework.

03

Maintains computational efficiency with O(1) complexity per iteration.

Abstract

We consider the problem of heteroskedastic generalized linear bandits (GLBs) with adversarial corruptions, which subsumes various stochastic contextual bandit settings, including heteroskedastic linear bandits and logistic/Poisson bandits. We propose HCW-GLB-OMD, which consists of two components: an online mirror descent (OMD)-based estimator and Hessian-based confidence weights to achieve corruption robustness. This is computationally efficient in that it only requires $O (1)$ space and time complexity per iteration. Under the self-concordance assumption on the link function, we show a regret bound of $\tilde{O} (d \sum_{t} g (τ_{t}) \overset{μ}{˙}_{t, ⋆} + d^{2} g_{m a x} κ + d κ C)$ , where $\overset{μ}{˙}_{t, ⋆}$ is the slope of $μ$ around the optimal arm at time $t$ , $g (τ_{t})$ 's are potentially exogenously time-varying dispersions (e.g., $g(\tau_t) =…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Mobile Crowdsensing and Crowdsourcing