Learning Selective Merge Policies for Deadline-Constrained Coded Caching via Deep Reinforcement Learning

Amirhossein Yousefiramandi

arXiv:2605.15236·cs.IT·May 18, 2026

Learning Selective Merge Policies for Deadline-Constrained Coded Caching via Deep Reinforcement Learning

Amirhossein Yousefiramandi

PDF

TL;DR

This paper introduces a deep reinforcement learning approach to optimize selective message merging in deadline-constrained coded caching, significantly reducing packet expiration and improving efficiency.

Contribution

It formulates the deadline-constrained coded delivery as a masked discrete-action queue control problem and trains a graph-attention policy network for optimal merging decisions.

Findings

01

Reduced broadcast-packet expiration ratio by 40.9% compared to baseline

02

Achieved best broadcast-efficiency score among coded multi-casting methods

03

Learned to merge only 31.8% of the time for tight deadline users

Abstract

With the coded caching, the server can use the information the users have cached to serve multiple users at a time by sending a single coded multi-casting message, i.e., the merged message, thereby relieving the peak network loads. However, for the delay-sensitive applications of the users, like the video streaming services, it becomes essential to choose which messages to merge online, considering the strict deadlines for each request. The problem, however, is that while the merge is helpful for the formation of the current coded multi-casting message, it can be harmful for the subsequent ones. We proposed a DRL-based solution that formulates the deadline-constrained coded delivery as a masked discrete-action queue-state control problem, while we trained a graph-attention policy network via proximal policy optimization. The policy network reduces the broadcast-packet expiration ratio…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.