Online Learning with Limited Information in the Sliding Window Model

Vladimir Braverman; Sumegha Garg; Chen Wang; David P. Woodruff; Samson Zhou

arXiv:2601.03533·stat.ML·January 8, 2026

Online Learning with Limited Information in the Sliding Window Model

Vladimir Braverman, Sumegha Garg, Chen Wang, David P. Woodruff, Samson Zhou

PDF

Open Access

TL;DR

This paper develops algorithms for the experts problem in the sliding window model, achieving near-optimal regret with minimal memory and query complexity, and extends results to bandit problems in data streams.

Contribution

It introduces memory-efficient algorithms for sliding window experts problems with multiple queries, and provides the first sublinear regret algorithm for bandit problems in streaming settings.

Findings

01

Achieves near-optimal regret with 2 queries and polylogarithmic memory.

02

Provides exponential memory improvement over previous interval regret algorithms.

03

First sublinear regret algorithm for bandit problems in streaming with polylogarithmic memory.

Abstract

Motivated by recent work on the experts problem in the streaming model, we consider the experts problem in the sliding window model. The sliding window model is a well-studied model that captures applications such as traffic monitoring, epidemic tracking, and automated trading, where recent information is more valuable than older data. Formally, we have $n$ experts, $T$ days, the ability to query the predictions of $q$ experts on each day, a limited amount of memory, and should achieve the (near-)optimal regret $nW polylog (n T)$ regret over any window of the last $W$ days. While it is impossible to achieve such regret with $1$ query, we show that with $2$ queries we can achieve such regret and with only $polylog (n T)$ bits of memory. Not only are our algorithms optimal for sliding windows, but we also show for every interval $I$ of days that we achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Age of Information Optimization