Online Learning with Limited Information in the Sliding Window Model
Vladimir Braverman, Sumegha Garg, Chen Wang, David P. Woodruff, Samson Zhou

TL;DR
This paper develops algorithms for the experts problem in the sliding window model, achieving near-optimal regret with minimal memory and query complexity, and extends results to bandit problems in data streams.
Contribution
It introduces memory-efficient algorithms for sliding window experts problems with multiple queries, and provides the first sublinear regret algorithm for bandit problems in streaming settings.
Findings
Achieves near-optimal regret with 2 queries and polylogarithmic memory.
Provides exponential memory improvement over previous interval regret algorithms.
First sublinear regret algorithm for bandit problems in streaming with polylogarithmic memory.
Abstract
Motivated by recent work on the experts problem in the streaming model, we consider the experts problem in the sliding window model. The sliding window model is a well-studied model that captures applications such as traffic monitoring, epidemic tracking, and automated trading, where recent information is more valuable than older data. Formally, we have experts, days, the ability to query the predictions of experts on each day, a limited amount of memory, and should achieve the (near-)optimal regret regret over any window of the last days. While it is impossible to achieve such regret with query, we show that with queries we can achieve such regret and with only bits of memory. Not only are our algorithms optimal for sliding windows, but we also show for every interval of days that we achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Age of Information Optimization
