Note on Follow-the-Perturbed-Leader in Combinatorial Semi-Bandit Problems

Botao Chen; Junya Honda

arXiv:2506.12490·cs.LG·July 23, 2025

Note on Follow-the-Perturbed-Leader in Combinatorial Semi-Bandit Problems

Botao Chen, Junya Honda

PDF

Open Access

TL;DR

This paper analyzes the optimality and computational complexity of Follow-the-Perturbed-Leader (FTPL) in combinatorial semi-bandit problems, demonstrating regret bounds and efficiency improvements with geometric resampling techniques.

Contribution

It establishes regret bounds for FTPL with geometric resampling in semi-bandit settings and introduces an extension to reduce computational complexity.

Findings

01

FTPL achieves near-optimal regret bounds in semi-bandit problems.

02

Conditional geometric resampling reduces computational complexity from O(d^2) to O(md(log(d/m)+1)).

03

FTPL with geometric resampling attains optimal regret bounds in adversarial settings.

Abstract

This paper studies the optimality and complexity of Follow-the-Perturbed-Leader (FTPL) policy in size-invariant combinatorial semi-bandit problems. Recently, Honda et al. (2023) and Lee et al. (2024) showed that FTPL achieves Best-of-Both-Worlds (BOBW) optimality in standard multi-armed bandit problems with Fr\'{e}chet-type distributions. However, the optimality of FTPL in combinatorial semi-bandit problems remains unclear. In this paper, we consider the regret bound of FTPL with geometric resampling (GR) in size-invariant semi-bandit setting, showing that FTPL respectively achieves $O (m^{2} d^{\frac{1}{α}} T + m d T)$ regret with Fr\'{e}chet distributions, and the best possible regret bound of $O (m d T)$ with Pareto distributions in adversarial setting. Furthermore, we extend the conditional geometric resampling (CGR) to size-invariant semi-bandit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Auction Theory and Applications