Edge Caching Optimization with PPO and Transfer Learning for Dynamic Environments
Farnaz Niknia, Ping Wang

TL;DR
This paper introduces a PPO-based edge caching strategy that adapts to dynamic environments by incorporating key file attributes and transfer learning, significantly improving caching efficiency and response times.
Contribution
It proposes a novel transfer learning-enhanced PPO algorithm for edge caching that adapts to changing content popularity and request rates efficiently.
Findings
Outperforms recent DRL-based caching methods in simulations.
Effectively detects and adapts to changes in content popularity.
Accelerates convergence in new environments using transfer learning.
Abstract
This paper addresses the challenge of edge caching in dynamic environments, where rising traffic loads strain backhaul links and core networks. We propose a Proximal Policy Optimization (PPO)-based caching strategy that fully incorporates key file attributes such as size, lifetime, importance, and popularity, while also considering random file request arrivals, reflecting more realistic edge caching scenarios. In dynamic environments, changes such as shifts in content popularity and variations in request rates frequently occur, making previously learned policies less effective as they were optimized for earlier conditions. Without adaptation, caching efficiency and response times can degrade. While learning a new policy from scratch in a new environment is an option, it is highly inefficient and computationally expensive. Thus, adapting an existing policy to these changes is critical.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery
MethodsEntropy Regularization · Proximal Policy Optimization
