A Reinforcement-Learning Approach to Proactive Caching in Wireless Networks
Samuel O. Somuyiwa, Andras Gyorgy, Deniz Gunduz

TL;DR
This paper develops a reinforcement learning-based proactive caching strategy for wireless networks that minimizes energy costs by dynamically managing cache contents based on channel conditions and user behavior.
Contribution
It introduces a threshold-based caching scheme optimized via reinforcement learning, addressing the intractability of exact threshold computation in dynamic environments.
Findings
Proposed schemes outperform reactive downloading significantly.
Near-optimal performance close to genie-aided lower bounds.
Dynamic caching adapts effectively to channel and user behavior.
Abstract
We consider a mobile user accessing contents in a dynamic environment, where new contents are generated over time (by the user's contacts), and remain relevant to the user for random lifetimes. The user, equipped with a finite-capacity cache memory, randomly accesses the system, and requests all the relevant contents at the time of access. The system incurs an energy cost associated with the number of contents downloaded and the channel quality at that time. Assuming causal knowledge of the channel quality, the content profile, and the user-access behavior, we model the proactive caching problem as a Markov decision process with the goal of minimizing the long-term average energy cost. We first prove the optimality of a threshold-based proactive caching scheme, which dynamically caches or removes appropriate contents from the memory, prior to being requested by the user, depending on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
