Learning Selective Merge Policies for Deadline-Constrained Coded Caching via Deep Reinforcement Learning
Amirhossein Yousefiramandi

TL;DR
This paper introduces a deep reinforcement learning approach to optimize selective message merging in deadline-constrained coded caching, significantly reducing packet expiration and improving efficiency.
Contribution
It formulates the deadline-constrained coded delivery as a masked discrete-action queue control problem and trains a graph-attention policy network for optimal merging decisions.
Findings
Reduced broadcast-packet expiration ratio by 40.9% compared to baseline
Achieved best broadcast-efficiency score among coded multi-casting methods
Learned to merge only 31.8% of the time for tight deadline users
Abstract
With the coded caching, the server can use the information the users have cached to serve multiple users at a time by sending a single coded multi-casting message, i.e., the merged message, thereby relieving the peak network loads. However, for the delay-sensitive applications of the users, like the video streaming services, it becomes essential to choose which messages to merge online, considering the strict deadlines for each request. The problem, however, is that while the merge is helpful for the formation of the current coded multi-casting message, it can be harmful for the subsequent ones. We proposed a DRL-based solution that formulates the deadline-constrained coded delivery as a masked discrete-action queue-state control problem, while we trained a graph-attention policy network via proximal policy optimization. The policy network reduces the broadcast-packet expiration ratio…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
