Scaling Serverless Functions in Edge Networks: A Reinforcement Learning Approach
Mounir Bensalem, Erkan Ipek, Admela Jukan

TL;DR
This paper presents a reinforcement learning-based auto-scaling solution for serverless functions in edge networks, significantly reducing delay and improving resource allocation for delay-sensitive applications.
Contribution
It introduces a novel RL approach for auto-scaling in edge serverless environments, outperforming traditional heuristics in delay reduction.
Findings
RL outperforms heuristics in delay metrics
Delay reduction of up to 50% achieved
Effective resource allocation in distributed edge networks
Abstract
With rapid advances in containerization techniques, the serverless computing model is becoming a valid candidate execution model in edge networking, similar to the widely used cloud model for applications that are stateless, single purpose and event-driven, and in particular for delay-sensitive applications. One of the cloud serverless processes, i.e., the auto-scaling mechanism, cannot be however directly applied at the edge, due to the distributed nature of edge nodes, the difficulty of optimal resource allocation, and the delay sensitivity of workloads. We propose a solution to the auto-scaling problem by applying reinforcement learning (RL) approach to solving problem of efficient scaling and resource allocation of serverless functions in edge networks. We compare RL and Deep RL algorithms with empirical, monitoring-based heuristics, considering delay-sensitive applications. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Blockchain Technology Applications and Security · Cloud Computing and Resource Management
