Trading Off Resource Budgets for Improved Regret Bounds

Damon Falck; Thomas Orton

arXiv:2210.05789·cs.LG·October 13, 2022

Trading Off Resource Budgets for Improved Regret Bounds

Damon Falck, Thomas Orton

PDF

Open Access 1 Video

TL;DR

This paper introduces FPML, an algorithm that balances resource budgets and regret bounds in adversarial online learning, with applications to submodular maximization, hyperparameter tuning, and linear programming.

Contribution

The paper proposes FPML, a novel algorithm that achieves a trade-off between resource budgets and regret bounds, extending online learning applications.

Findings

01

Achieves regret of O(T^{1/(B+1)} log(N)^{B/(B+1)}) with FPML.

02

Generalizes algorithms for online submodular maximization using FPML.

03

Empirically effective in hyperparameter optimization and linear programming.

Abstract

In this work we consider a variant of adversarial online learning where in each round one picks $B$ out of $N$ arms and incurs cost equal to the $minimum$ of the costs of each arm chosen. We propose an algorithm called Follow the Perturbed Multiple Leaders (FPML) for this problem, which we show (by adapting the techniques of Kalai and Vempala [2005]) achieves expected regret $O (T^{\frac{1}{B + 1}} ln (N)^{\frac{B}{B + 1}})$ over time horizon $T$ relative to the $single$ best arm in hindsight. This introduces a trade-off between the budget $B$ and the single-best-arm regret, and we proceed to investigate several applications of this trade-off. First, we observe that algorithms which use standard regret minimizers as subroutines can sometimes be adapted by replacing these subroutines with FPML, and we use this to generalize existing algorithms for Online Submodular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Trading Off Resource Budgets For Improved Regret Bounds· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning