Refined approachability algorithms and application to regret minimization with global costs
Joon Kwon

TL;DR
This paper introduces refined approachability algorithms based on Follow the Regularized Leader, capable of minimizing various distance measures, and applies them to achieve new regret bounds in online learning with global costs.
Contribution
It extends approachability algorithms to minimize diverse distance metrics and provides the first explicit regret bounds for _p costs with dependence on p and dimension.
Findings
Developed a class of FTRL algorithms for approachability with flexible distance measures.
Achieved the first explicit regret bounds for _p costs depending on p and dimension.
Demonstrated applicability to online learning problems with global costs.
Abstract
Blackwell's approachability is a framework where two players, the Decision Maker and the Environment, play a repeated game with vector-valued payoffs. The goal of the Decision Maker is to make the average payoff converge to a given set called the target. When this is indeed possible, simple algorithms which guarantee the convergence are known. This abstract tool was successfully used for the construction of optimal strategies in various repeated games, but also found several applications in online learning. By extending an approach proposed by (Abernethy et al., 2011), we construct and analyze a class of Follow the Regularized Leader algorithms (FTRL) for Blackwell's approachability which are able to minimize not only the Euclidean distance to the target set (as it is often the case in the context of Blackwell's approachability) but a wide range of distance-like quantities. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
