Online learning with Erd\H{o}s-R\'enyi side-observation graphs

Tom\'a\v{s} Koc\'ak; Gergely Neu; Michal Valko

arXiv:2604.25271·stat.ML·April 29, 2026·2 cites

Online learning with Erd\H{o}s-R\'enyi side-observation graphs

Tom\'a\v{s} Koc\'ak, Gergely Neu, Michal Valko

PDF

1 Datasets

TL;DR

This paper introduces algorithms for adversarial multi-armed bandit problems with side observations, achieving near-optimal regret bounds depending on the probability of observing additional arm losses.

Contribution

The paper proposes two algorithms tailored for different observation probabilities, providing near-optimal regret bounds in adversarial bandit settings with side observations.

Findings

01

First algorithm achieves $O(\sqrt{(T /r) \log N })$ regret for $r \ge (\log T)/(2N)$.

02

Second algorithm achieves $O(\sqrt{(T/r) \\log (N+T)})$ regret for smaller $r$.

03

A quick estimation procedure determines the relevant range of $r$.

Abstract

We consider adversarial multi-armed bandit problems where the learner is allowed to observe losses of a number of arms beside the arm that it actually chose. We study the case where all non-chosen arms reveal their loss with a fixed but unknown probability $r$ , independently of each other and the action of the learner. We propose two algorithms that work for different ranges of $r$ . We show that after $T$ rounds in a bandit problem with $N$ arms, the expected regret of our first algorithm is $O ((T / r) lo g N)$ whenever $r \geq (lo g T) / (2 N)$ , while our second algorithm achieves a regret of $O ((T / r) lo g (N + T))$ for smaller values of $r$ . We also give a quick estimation procedure that decides the range of~ $r$ . All our bounds are within logarithmic factors of the best achievable performance of any algorithm that is even allowed to know~ $r$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

misovalko/my-research-papers
dataset· 103 dl
103 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.