Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making
Xin Chen, I-Hong Hou

TL;DR
This paper introduces a new framework called Contextual Restless Bandits for complex decision-making, combining features of contextual and restless bandits, with scalable algorithms and applications to smart grid demand response.
Contribution
It develops a novel CRB framework, scalable index policy algorithms, and applies them to demand response in smart grids, with theoretical analysis and online learning capabilities.
Findings
The proposed algorithms are scalable and efficient.
Theoretical analysis confirms asymptotic optimality.
Numerical simulations demonstrate strong performance in demand response.
Abstract
This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making. This CRB framework incorporates the core features of contextual bandits and restless bandits, so that it can model both the internal state transitions of each arm and the influence of external global environmental contexts. Using the dual decomposition method, we develop a scalable index policy algorithm for solving the CRB problem, and theoretically analyze the asymptotical optimality of this algorithm. In the case when the arm models are unknown, we further propose a model-based online learning algorithm based on the index policy to learn the arm models and make decisions simultaneously. Furthermore, we apply the proposed CRB framework and the index policy algorithm specifically to the demand response decision-making problem in smart grids. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Data Stream Mining Techniques
