Refined approachability algorithms and application to regret   minimization with global costs

Joon Kwon

arXiv:2009.03831·cs.LG·September 8, 2021

Refined approachability algorithms and application to regret minimization with global costs

Joon Kwon

PDF

Open Access

TL;DR

This paper introduces refined approachability algorithms based on Follow the Regularized Leader, capable of minimizing various distance measures, and applies them to achieve new regret bounds in online learning with global costs.

Contribution

It extends approachability algorithms to minimize diverse distance metrics and provides the first explicit regret bounds for _p costs with dependence on p and dimension.

Findings

01

Developed a class of FTRL algorithms for approachability with flexible distance measures.

02

Achieved the first explicit regret bounds for _p costs depending on p and dimension.

03

Demonstrated applicability to online learning problems with global costs.

Abstract

Blackwell's approachability is a framework where two players, the Decision Maker and the Environment, play a repeated game with vector-valued payoffs. The goal of the Decision Maker is to make the average payoff converge to a given set called the target. When this is indeed possible, simple algorithms which guarantee the convergence are known. This abstract tool was successfully used for the construction of optimal strategies in various repeated games, but also found several applications in online learning. By extending an approach proposed by (Abernethy et al., 2011), we construct and analyze a class of Follow the Regularized Leader algorithms (FTRL) for Blackwell's approachability which are able to minimize not only the Euclidean distance to the target set (as it is often the case in the context of Blackwell's approachability) but a wide range of distance-like quantities. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems