Pseudonorm Approachability and Applications to Regret Minimization

Christoph Dann; Yishay Mansour; Mehryar Mohri; Jon Schneider,; Balasubramanian Sivan

arXiv:2302.01517·cs.LG·February 6, 2023·1 cites

Pseudonorm Approachability and Applications to Regret Minimization

Christoph Dann, Yishay Mansour, Mehryar Mohri, Jon Schneider,, Balasubramanian Sivan

PDF

Open Access

TL;DR

This paper introduces a novel framework for low-dimensional pseudonorm approachability, enabling efficient $ ext{l}_ ext{infinity}$-approachability algorithms with convergence rates independent of high-dimensional payoff spaces.

Contribution

It develops a pseudonorm approachability theory, reducing high-dimensional $ ext{l}_ ext{infinity}$ problems to low-dimensional ones, and provides algorithms with dimension-independent convergence.

Findings

01

Dimension-independent convergence for $ ext{l}_ ext{infinity}$-approachability algorithms.

02

Polynomial-time complexity assuming efficient $ ext{l}_ ext{infinity}$ distance computation.

03

Logarithmic convergence algorithms using FTRL with maximum-entropy regularizer.

Abstract

Blackwell's celebrated approachability theory provides a general framework for a variety of learning problems, including regret minimization. However, Blackwell's proof and implicit algorithm measure approachability using the $ℓ_{2}$ (Euclidean) distance. We argue that in many applications such as regret minimization, it is more useful to study approachability under other distance metrics, most commonly the $ℓ_{\infty}$ -metric. But, the time and space complexity of the algorithms designed for $ℓ_{\infty}$ -approachability depend on the dimension of the space of the vectorial payoffs, which is often prohibitively large. Thus, we present a framework for converting high-dimensional $ℓ_{\infty}$ -approachability problems to low-dimensional pseudonorm approachability problems, thereby resolving such issues. We first show that the $ℓ_{\infty}$ -distance between the average payoff and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification