A Jointly Efficient and Optimal Algorithm for Heteroskedastic Generalized Linear Bandits with Adversarial Corruptions
Sanghwa Kim, Junghyun Lee, Se-Young Yun

TL;DR
This paper introduces HCW-GLB-OMD, an efficient algorithm for heteroskedastic generalized linear bandits with adversarial corruptions, achieving near-optimal regret bounds across diverse settings.
Contribution
The paper proposes a novel, computationally efficient algorithm that unifies and improves regret bounds for heteroskedastic GLBs under adversarial corruptions.
Findings
Achieves regret bounds close to the lower bound, demonstrating near-optimality.
Unifies various heteroskedastic bandit settings under a single framework.
Maintains computational efficiency with O(1) complexity per iteration.
Abstract
We consider the problem of heteroskedastic generalized linear bandits (GLBs) with adversarial corruptions, which subsumes various stochastic contextual bandit settings, including heteroskedastic linear bandits and logistic/Poisson bandits. We propose HCW-GLB-OMD, which consists of two components: an online mirror descent (OMD)-based estimator and Hessian-based confidence weights to achieve corruption robustness. This is computationally efficient in that it only requires space and time complexity per iteration. Under the self-concordance assumption on the link function, we show a regret bound of , where is the slope of around the optimal arm at time , 's are potentially exogenously time-varying dispersions (e.g., $g(\tau_t) =…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Mobile Crowdsensing and Crowdsourcing
