Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

TL;DR
This paper introduces a novel reinforcement learning approach using Whittle index and linear function approximation for wireless edge caching, effectively minimizing latency despite unknown system parameters.
Contribution
It develops a Q^{+}-Whittle-LFA algorithm that combines Whittle index policy with linear function approximation for scalable, model-free caching optimization.
Findings
Q^{+}-Whittle-LFA achieves low latency in simulations
The algorithm has a finite-time mean-square error bound
Empirical results show excellent performance with real data
Abstract
We consider the problem of content caching at the wireless edge to serve a set of end users via unreliable wireless channels so as to minimize the average latency experienced by end users due to the constrained wireless edge cache capacity. We formulate this problem as a Markov decision process, or more specifically a restless multi-armed bandit problem, which is provably hard to solve. We begin by investigating a discounted counterpart, and prove that it admits an optimal policy of the threshold-type. We then show that this result also holds for average latency problem. Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency. Since system parameters such as content request rates and wireless channel conditions are often unknown and time-varying, we further develop a model-free reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Energy Harvesting in Wireless Networks · Optimization and Search Problems
