Continual Model-based Reinforcement Learning for Data Efficient Wireless   Network Optimisation

Cengis Hasan; Alexandros Agapitos; David Lynch; Alberto Castagna,; Giorgio Cruciata; Hao Wang; and Aleksandar Milenovic

arXiv:2404.19462·cs.LG·May 1, 2024

Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation

Cengis Hasan, Alexandros Agapitos, David Lynch, Alberto Castagna,, Giorgio Cruciata, Hao Wang, and Aleksandar Milenovic

PDF

Open Access

TL;DR

This paper introduces a continual reinforcement learning approach for wireless network optimization, significantly reducing deployment time while maintaining optimization performance.

Contribution

It proposes a novel continual RL method that leverages domain expert knowledge to efficiently adapt control policies across multiple network sites.

Findings

01

Deployment time halved compared to baseline

02

Maintains optimization gain without performance drop

03

Effective in real-world wireless network scenarios

Abstract

We present a method that addresses the pain point of long lead-time required to deploy cell-level parameter optimisation policies to new wireless network sites. Given a sequence of action spaces represented by overlapping subsets of cell-level configuration parameters provided by domain experts, we formulate throughput optimisation as Continual Reinforcement Learning of control policies. Simulation results suggest that the proposed system is able to shorten the end-to-end deployment lead-time by two-fold compared to a reinitialise-and-retrain baseline without any drop in optimisation gain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWireless Networks and Protocols · Advanced MIMO Systems Optimization