Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
Lam Dinh, Pham Tran Anh Quang, J\'er\'emie Leguay

TL;DR
This paper presents a safe load balancing algorithm for SD-WAN that combines Deep Reinforcement Learning with Control Barrier Functions to ensure safety and improve training efficiency, achieving near-optimal QoS.
Contribution
The paper introduces a novel safe learning-based load balancing method integrating CBF with DRL, enhancing safety and training speed in SD-WAN environments.
Findings
The approach achieves approximately 110x faster training on GPU.
It delivers near-optimal QoS in terms of end-to-end delay.
On-policy PPO outperforms off-policy DDPG when combined with CBF.
Abstract
Deep Reinforcement Learning (DRL) algorithms have recently made significant strides in improving network performance. Nonetheless, their practical use is still limited in the absence of safe exploration and safe decision-making. In the context of commercial solutions, reliable and safe-to-operate systems are of paramount importance. Taking this problem into account, we propose a safe learning-based load balancing algorithm for Software Defined-Wide Area Network (SD-WAN), which is empowered by Deep Reinforcement Learning (DRL) combined with a Control Barrier Function (CBF). It safely projects unsafe actions into feasible ones during both training and testing, and it guides learning towards safe policies. We successfully implemented the solution on GPU to accelerate training by approximately 110x times and achieve model updates for on-policy methods within a few seconds, making the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced Photonic Communication Systems · Advanced Optical Network Technologies
