Whittle Index based Q-Learning for Wireless Edge Caching with Linear   Function Approximation

Guojun Xiong; Shufan Wang; Jian Li; Rahul Singh

arXiv:2202.13187·cs.NI·February 24, 2023

Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation

Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning approach using Whittle index and linear function approximation for wireless edge caching, effectively minimizing latency despite unknown system parameters.

Contribution

It develops a Q^{+}-Whittle-LFA algorithm that combines Whittle index policy with linear function approximation for scalable, model-free caching optimization.

Findings

01

Q^{+}-Whittle-LFA achieves low latency in simulations

02

The algorithm has a finite-time mean-square error bound

03

Empirical results show excellent performance with real data

Abstract

We consider the problem of content caching at the wireless edge to serve a set of end users via unreliable wireless channels so as to minimize the average latency experienced by end users due to the constrained wireless edge cache capacity. We formulate this problem as a Markov decision process, or more specifically a restless multi-armed bandit problem, which is provably hard to solve. We begin by investigating a discounted counterpart, and prove that it admits an optimal policy of the threshold-type. We then show that this result also holds for average latency problem. Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency. Since system parameters such as content request rates and wireless channel conditions are often unknown and time-varying, we further develop a model-free reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCaching and Content Delivery · Energy Harvesting in Wireless Networks · Optimization and Search Problems