Deep Reinforcement Learning for Dynamic Spectrum Sharing of LTE and NR
Ursula Challita, David Sandberg

TL;DR
This paper introduces a deep reinforcement learning-based controller for proactive dynamic spectrum sharing between 4G LTE and 5G NR, optimizing resource allocation by predicting future network states to improve overall system performance.
Contribution
It presents a novel deep RL architecture using Monte Carlo Tree Search for proactive spectrum management in LTE and NR systems, considering future network conditions.
Findings
Improves spectrum sharing efficiency over greedy methods.
Accounts for future network states in resource allocation.
Enhances overall system-level performance.
Abstract
In this paper, a proactive dynamic spectrum sharing scheme between 4G and 5G systems is proposed. In particular, a controller decides on the resource split between NR and LTE every subframe while accounting for future network states such as high interference subframes and multimedia broadcast single frequency network (MBSFN) subframes. To solve this problem, a deep reinforcement learning (RL) algorithm based on Monte Carlo Tree Search (MCTS) is proposed. The introduced deep RL architecture is trained offline whereby the controller predicts a sequence of future states of the wireless access network by simulating hypothetical bandwidth splits over time starting from the current network state. The action sequence resulting in the best reward is then assigned. This is realized by predicting the quantities most directly relevant to planning, i.e., the reward, the action probabilities, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
