Adversarial Online Learning with noise

Alon Resler; Yishay Mansour

arXiv:1810.09346·cs.LG·November 6, 2018

Adversarial Online Learning with noise

Alon Resler, Yishay Mansour

PDF

Open Access

TL;DR

This paper investigates adversarial online learning models with noisy feedback, providing tight regret bounds for scenarios with both constant and variable Bernoulli noise rates in full information and bandit settings.

Contribution

It introduces models of adversarial online learning with noisy feedback and derives tight regret bounds for these settings, extending understanding of learning under noise.

Findings

01

Tight regret bounds established for noisy feedback scenarios.

02

Analysis covers both full information and bandit feedback.

03

Results include constant and variable Bernoulli noise rates.

Abstract

We present and study models of adversarial online learning where the feedback observed by the learner is noisy, and the feedback is either full information feedback or bandit feedback. Specifically, we consider binary losses xored with the noise, which is a Bernoulli random variable. We consider both a constant noise rate and a variable noise rate. Our main results are tight regret bounds for learning with noise in the adversarial online learning model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems