Adversaries in Online Learning Revisited: with applications in Robust   Optimization and Adversarial training

Sebastian Pokutta; Huan Xu

arXiv:2101.11443·cs.LG·January 28, 2021

Adversaries in Online Learning Revisited: with applications in Robust Optimization and Adversarial training

Sebastian Pokutta, Huan Xu

PDF

Open Access

TL;DR

This paper clarifies the concept of adversaries in online learning, distinguishes between anticipative and non-anticipative adversaries, and applies this understanding to develop a general approach for robust optimization and adversarial training using a game-theoretic framework.

Contribution

It introduces a rigorous distinction between types of adversaries in online learning and develops a meta-game approach for robust optimization and adversarial training.

Findings

01

Different types of adversaries affect regret guarantees.

02

The proposed meta-game approach effectively solves robust optimization problems.

03

The method generalizes previous approaches like arXiv:1402.6361.

Abstract

We revisit the concept of "adversary" in online learning, motivated by solving robust optimization and adversarial training using online learning methods. While one of the classical setups in online learning deals with the "adversarial" setup, it appears that this concept is used less rigorously, causing confusion in applying results and insights from online learning. Specifically, there are two fundamentally different types of adversaries, depending on whether the "adversary" is able to anticipate the exogenous randomness of the online learning algorithms. This is particularly relevant to robust optimization and adversarial training because the adversarial sequences are often anticipative, and many online learning algorithms do not achieve diminishing regret in such a case. We then apply this to solving robust optimization problems or (equivalently) adversarial training problems via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Reinforcement Learning in Robotics