Heavy-tailed Linear Bandits: Adversarial Robustness, Best-of-both-worlds, and Beyond

Canzhe Zhao; Shinji Ito; Shuai Li

arXiv:2508.13679·cs.LG·August 20, 2025

Heavy-tailed Linear Bandits: Adversarial Robustness, Best-of-both-worlds, and Beyond

Canzhe Zhao, Shinji Ito, Shuai Li

PDF

TL;DR

This paper introduces a novel framework for heavy-tailed bandit problems that achieves robust, optimal regret bounds in both stochastic and adversarial settings, extending to linear bandits with heavy-tailed noise.

Contribution

It proposes the first FTRL-based best-of-both-worlds algorithm for heavy-tailed bandits, including linear bandits, without requiring truncation assumptions.

Findings

01

Achieves $ ilde{O}(T^{1/\varepsilon})$ worst-case regret in adversarial regime.

02

Attains $ ilde{O}(\log T)$ regret in stochastic regime.

03

Introduces the HT-SPM algorithm with data-dependent learning rates for heavy-tailed bandits.

Abstract

Heavy-tailed bandits have been extensively studied since the seminal work of \citet{Bubeck2012BanditsWH}. In particular, heavy-tailed linear bandits, enabling efficient learning with both a large number of arms and heavy-tailed noises, have recently attracted significant attention \citep{ShaoYKL18,XueWWZ20,ZhongHYW21,Wang2025heavy,tajdini2025improved}. However, prior studies focus almost exclusively on stochastic regimes, with few exceptions limited to the special case of heavy-tailed multi-armed bandits (MABs) \citep{Huang0H22,ChengZ024,Chen2024uniINF}. In this work, we propose a general framework for adversarial heavy-tailed bandit problems, which performs follow-the-regularized-leader (FTRL) over the loss estimates shifted by a bonus function. Via a delicate setup of the bonus function, we devise the first FTRL-type best-of-both-worlds (BOBW) algorithm for heavy-tailed MABs, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.