Minimax Optimal Quantile and Semi-Adversarial Regret via   Root-Logarithmic Regularizers

Jeffrey Negrea; Blair Bilodeau; Nicol\`o Campolongo; Francesco; Orabona; Daniel M. Roy

arXiv:2110.14804·stat.ML·November 9, 2021

Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers

Jeffrey Negrea, Blair Bilodeau, Nicol\`o Campolongo, Francesco, Orabona, Daniel M. Roy

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces root-logarithmic regularizers in FTRL algorithms to achieve minimax optimal quantile and semi-adversarial regret bounds, extending theoretical guarantees to broader settings and improving adaptivity.

Contribution

It develops novel regularizers for FTRL that attain minimax optimal regret in quantile and semi-adversarial paradigms, extending bounds to uncountable expert classes and providing tight lower bounds.

Findings

01

Achieves minimax optimal regret bounds in both paradigms.

02

Extends KL regret bounds to uncountable expert classes with arbitrary priors.

03

Provides tight lower bounds for quantile regret on finite expert classes.

Abstract

Quantile (and, more generally, KL) regret bounds, such as those achieved by NormalHedge (Chaudhuri, Freund, and Hsu 2009) and its variants, relax the goal of competing against the best individual expert to only competing against a majority of experts on adversarial data. More recently, the semi-adversarial paradigm (Bilodeau, Negrea, and Roy 2020) provides an alternative relaxation of adversarial online learning by considering data that may be neither fully adversarial nor stochastic (i.i.d.). We achieve the minimax optimal regret in both paradigms using FTRL with separate, novel, root-logarithmic regularizers, both of which can be interpreted as yielding variants of NormalHedge. We extend existing KL regret upper bounds, which hold uniformly over target distributions, to possibly uncountable expert classes with arbitrary priors; provide the first full-information lower bounds for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blairbilodeau/neurips-2021
noneOfficial

Videos

Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning