Replicable Online Learning

Saba Ahmadi; Siddharth Bhandari; Avrim Blum

arXiv:2411.13730·cs.LG·November 22, 2024

Replicable Online Learning

Saba Ahmadi, Siddharth Bhandari, Avrim Blum

PDF

Open Access 1 Video

TL;DR

This paper introduces the concept of adversarially replicable online learning algorithms that produce identical actions across independent runs, even under adversarially chosen, time-varying input distributions, and provides algorithms, frameworks, and bounds for this setting.

Contribution

It extends the notion of replicability to adversarial online settings, develops algorithms for linear optimization and experts problems, and establishes regret bounds and lower bounds for such algorithms.

Findings

01

Developed adversarially replicable algorithms with sub-linear regret.

02

Created a framework to convert online learners into adversarially replicable algorithms.

03

Established regret lower bounds for replicable online algorithms.

Abstract

We investigate the concept of algorithmic replicability introduced by Impagliazzo et al. 2022, Ghazi et al. 2021, Ahn et al. 2024 in an online setting. In our model, the input sequence received by the online learner is generated from time-varying distributions chosen by an adversary (obliviously). Our objective is to design low-regret online algorithms that, with high probability, produce the exact same sequence of actions when run on two independently sampled input sequences generated as described above. We refer to such algorithms as adversarially replicable. Previous works (such as Esfandiari et al. 2022) explored replicability in the online setting under inputs generated independently from a fixed distribution; we term this notion as iid-replicability. Our model generalizes to capture both adversarial and iid input sequences, as well as their mixtures, which can be modeled by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Replicable Online Learning· slideslive

Taxonomy

TopicsHigher Education Learning Practices