Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Emmanuel Esposito; Federico Fusco; Dirk van der Hoeven; Nicol\`o; Cesa-Bianchi

arXiv:2210.04229·cs.LG·February 20, 2024

Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Emmanuel Esposito, Federico Fusco, Dirk van der Hoeven, Nicol\`o, Cesa-Bianchi

PDF

Open Access 1 Video

TL;DR

This paper introduces a new stochastic feedback graph model for online learning, providing nearly optimal regret bounds and algorithms that adapt to the graph's stochastic structure without prior knowledge.

Contribution

It extends feedback graph models to stochastic settings, deriving nearly optimal regret bounds and developing algorithms that adapt to the stochastic graph structure.

Findings

01

Achieves nearly optimal regret bounds in stochastic feedback graphs.

02

Develops algorithms that adapt without prior knowledge of the graph.

03

Provides improved bounds for specific graph structures.

Abstract

The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erd\H{o}s-R\'enyi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. We prove nearly optimal regret bounds of order $min {min_{ε} (α_{ε} / ε) T, min_{ε} (δ_{ε} / ε)^{1/3} T^{2/3}}$ (ignoring logarithmic factors), where $α_{ε}$ and $δ_{ε}$ are graph-theoretic quantities measured on the support of the stochastic feedback graph $G$ with edge probabilities thresholded at $ε$ . Our result, which holds without any preliminary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning on the Edge: Online Learning with Stochastic Feedback Graphs· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications