V2N Service Scaling with Deep Reinforcement Learning
Cyril Shih-Huan Hsu, Jorge Mart\'in-P\'erez, Chrysa Papagianni, Paola Grosso

TL;DR
This paper presents a DRL-based auto-scaling method for edge computing in 5G vehicular networks, improving efficiency and performance through a novel discretization of DDPG.
Contribution
It introduces a discretization approach for DDPG to handle discrete scaling actions in edge computing, demonstrating superior performance with real vehicular data.
Findings
Reduces active CPU usage by at least 23%.
Increases long-term reward by 24%.
Outperforms existing auto-scaling solutions.
Abstract
The fifth generation (5G) of wireless networks is set out to meet the stringent requirements of vehicular use cases. Edge computing resources can aid in this direction by moving processing closer to end-users, reducing latency. However, given the stochastic nature of traffic loads and availability of physical resources, appropriate auto-scaling mechanisms need to be employed to support cost-efficient and performant services. To this end, we employ Deep Reinforcement Learning (DRL) for vertical scaling in Edge computing to support vehicular-to-network communications. We address the problem using Deep Deterministic Policy Gradient (DDPG). As DDPG is a model-free off-policy algorithm for learning continuous actions, we introduce a discretization approach to support discrete scaling actions. Thus we address scalability problems inherent to high-dimensional discrete action spaces. Employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Vehicular Ad Hoc Networks (VANETs) · Privacy-Preserving Technologies in Data
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Batch Normalization · Adam · Weight Decay · Experience Replay · Dense Connections · Deep Deterministic Policy Gradient
