Trading Off Resource Budgets for Improved Regret Bounds
Damon Falck, Thomas Orton

TL;DR
This paper introduces FPML, an algorithm that balances resource budgets and regret bounds in adversarial online learning, with applications to submodular maximization, hyperparameter tuning, and linear programming.
Contribution
The paper proposes FPML, a novel algorithm that achieves a trade-off between resource budgets and regret bounds, extending online learning applications.
Findings
Achieves regret of O(T^{1/(B+1)} log(N)^{B/(B+1)}) with FPML.
Generalizes algorithms for online submodular maximization using FPML.
Empirically effective in hyperparameter optimization and linear programming.
Abstract
In this work we consider a variant of adversarial online learning where in each round one picks out of arms and incurs cost equal to the of the costs of each arm chosen. We propose an algorithm called Follow the Perturbed Multiple Leaders (FPML) for this problem, which we show (by adapting the techniques of Kalai and Vempala [2005]) achieves expected regret over time horizon relative to the best arm in hindsight. This introduces a trade-off between the budget and the single-best-arm regret, and we proceed to investigate several applications of this trade-off. First, we observe that algorithms which use standard regret minimizers as subroutines can sometimes be adapted by replacing these subroutines with FPML, and we use this to generalize existing algorithms for Online Submodular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
