uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
Yu Chen, Jiatai Huang, Yan Dai, Longbo Huang

TL;DR
uniINF is a novel parameter-free algorithm for heavy-tailed multi-armed bandits that performs optimally in both stochastic and adversarial settings without prior knowledge of environment parameters.
Contribution
It introduces the first parameter-free, Best-of-Both-Worlds algorithm for heavy-tailed MABs, extending robustness to adversarial environments and eliminating the need for known tail parameters.
Findings
Achieves nearly-optimal regret in stochastic and adversarial environments.
First parameter-free algorithm with BoBW guarantees for heavy-tailed MABs.
Develops new techniques like auto-balancing learning rates and adaptive loss tuning.
Abstract
In this paper, we present a novel algorithm, uniINF, for the Heavy-Tailed Multi-Armed Bandits (HTMAB) problem, demonstrating robustness and adaptability in both stochastic and adversarial environments. Unlike the stochastic MAB setting where loss distributions are stationary with time, our study extends to the adversarial setup, where losses are generated from heavy-tailed distributions that depend on both arms and time. Our novel algorithm `uniINF` enjoys the so-called Best-of-Both-Worlds (BoBW) property, performing optimally in both stochastic and adversarial environments without knowing the exact environment type. Moreover, our algorithm also possesses a Parameter-Free feature, i.e., it operates without the need of knowing the heavy-tail parameters a-priori. To be precise, uniINF ensures nearly-optimal regret in both stochastic and adversarial environments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Surface Polishing Techniques · Advanced Numerical Methods in Computational Mathematics
