Management and Orchestration of Virtual Network Functions via Deep Reinforcement Learning
Joan S Pujol Roig, David M. Gutierrez-Estevez, Deniz G\"und\"uz

TL;DR
This paper introduces a deep reinforcement learning-based algorithm for managing and orchestrating virtual network functions in 5G networks, aiming to optimize resource allocation, reduce costs, and improve quality of service.
Contribution
It presents a novel DRL approach called parameterized action twin (PAT) deterministic policy gradient for autonomous resource management of VNFs in 5G networks.
Findings
The proposed DRL method effectively minimizes costs and enhances QoS.
Numerical results demonstrate alignment with 5G KPIs.
First application of DRL to VNF resource management in 5G.
Abstract
Management and orchestration (MANO) of resources by virtual network functions (VNFs) represents one of the key challenges towards a fully virtualized network architecture as envisaged by 5G standards. Current threshold-based policies inefficiently over-provision network resources and under-utilize available hardware, incurring high cost for network operators, and consequently, the users. In this work, we present a MANO algorithm for VNFs allowing a central unit (CU) to learn to autonomously re-configure resources (processing power and storage), deploy new VNF instances, or offload them to the cloud, depending on the network conditions, available pool of resources, and the VNF requirements, with the goal of minimizing a cost function that takes into account the economical cost as well as latency and the quality-of-service (QoS) experienced by the users. First, we formulate the stochastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
