Online Frequency Scheduling by Learning Parallel Actions

Anastasios Giovanidis; Mathieu Leconte; Sabrine Aroua; Tor Kvernvik,; David Sandberg

arXiv:2406.05041·cs.NI·December 18, 2024

Online Frequency Scheduling by Learning Parallel Actions

Anastasios Giovanidis, Mathieu Leconte, Sabrine Aroua, Tor Kvernvik,, David Sandberg

PDF

Open Access

TL;DR

This paper introduces a novel deep reinforcement learning-based frequency scheduler for multi-user MIMO systems in 6G networks, enabling efficient, scalable, and adaptable resource allocation with parallel decision-making.

Contribution

It proposes a new action-branching deep Q-learning architecture with variations like Unibranch and GNN-based models for scalable, real-time frequency scheduling.

Findings

01

Achieves competitive performance against existing methods.

02

Enables online fine-tuning to adapt to changing environments.

03

Reduces model complexity with proposed variations.

Abstract

Radio Resource Management is a challenging topic in future 6G networks where novel applications create strong competition among the users for the available resources. In this work we consider the frequency scheduling problem in a multi-user MIMO system. Frequency resources need to be assigned to a set of users while allowing for concurrent transmissions in the same sub-band. Traditional methods are insufficient to cope with all the involved constraints and uncertainties, whereas reinforcement learning can directly learn near-optimal solutions for such complex environments. However, the scheduling problem has an enormous action space accounting for all the combinations of users and sub-bands, so out-of-the-box algorithms cannot be used directly. In this work, we propose a scheduler based on action-branching over sub-bands, which is a deep Q-learning architecture with parallel decision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Wireless Network Optimization · Scheduling and Timetabling Solutions · Scheduling and Optimization Algorithms

MethodsSparse Evolutionary Training · Q-Learning