uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed   MABs

Yu Chen; Jiatai Huang; Yan Dai; Longbo Huang

arXiv:2410.03284·cs.LG·February 26, 2025

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Yu Chen, Jiatai Huang, Yan Dai, Longbo Huang

PDF

Open Access

TL;DR

uniINF is a novel parameter-free algorithm for heavy-tailed multi-armed bandits that performs optimally in both stochastic and adversarial settings without prior knowledge of environment parameters.

Contribution

It introduces the first parameter-free, Best-of-Both-Worlds algorithm for heavy-tailed MABs, extending robustness to adversarial environments and eliminating the need for known tail parameters.

Findings

01

Achieves nearly-optimal regret in stochastic and adversarial environments.

02

First parameter-free algorithm with BoBW guarantees for heavy-tailed MABs.

03

Develops new techniques like auto-balancing learning rates and adaptive loss tuning.

Abstract

In this paper, we present a novel algorithm, uniINF, for the Heavy-Tailed Multi-Armed Bandits (HTMAB) problem, demonstrating robustness and adaptability in both stochastic and adversarial environments. Unlike the stochastic MAB setting where loss distributions are stationary with time, our study extends to the adversarial setup, where losses are generated from heavy-tailed distributions that depend on both arms and time. Our novel algorithm `uniINF` enjoys the so-called Best-of-Both-Worlds (BoBW) property, performing optimally in both stochastic and adversarial environments without knowing the exact environment type. Moreover, our algorithm also possesses a Parameter-Free feature, i.e., it operates without the need of knowing the heavy-tail parameters $(σ, α)$ a-priori. To be precise, uniINF ensures nearly-optimal regret in both stochastic and adversarial environments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Surface Polishing Techniques · Advanced Numerical Methods in Computational Mathematics