Learning on the Edge: Online Learning with Stochastic Feedback Graphs
Emmanuel Esposito, Federico Fusco, Dirk van der Hoeven, Nicol\`o, Cesa-Bianchi

TL;DR
This paper introduces a new stochastic feedback graph model for online learning, providing nearly optimal regret bounds and algorithms that adapt to the graph's stochastic structure without prior knowledge.
Contribution
It extends feedback graph models to stochastic settings, deriving nearly optimal regret bounds and developing algorithms that adapt to the stochastic graph structure.
Findings
Achieves nearly optimal regret bounds in stochastic feedback graphs.
Develops algorithms that adapt without prior knowledge of the graph.
Provides improved bounds for specific graph structures.
Abstract
The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erd\H{o}s-R\'enyi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. We prove nearly optimal regret bounds of order (ignoring logarithmic factors), where and are graph-theoretic quantities measured on the support of the stochastic feedback graph with edge probabilities thresholded at . Our result, which holds without any preliminary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications
