Learning Augmented Index Policy for Optimal Service Placement at the Network Edge
Guojun Xiong, Rahul Singh, Jian Li

TL;DR
This paper develops learning-augmented algorithms for optimal service placement at the network edge, leveraging Whittle indices and addressing unknown, time-varying request rates to minimize latency.
Contribution
It derives explicit Whittle indices for single-service MDPs and introduces two novel algorithms, UCB-Whittle and Q-learning-Whittle, with theoretical performance guarantees.
Findings
Algorithms achieve low regret in learning request rates.
Proposed policies outperform baseline methods in simulations.
The approach effectively balances exploration and exploitation.
Abstract
We consider the problem of service placement at the network edge, in which a decision maker has to choose between services to host at the edge to satisfy the demands of customers. Our goal is to design adaptive algorithms to minimize the average service delivery latency for customers. We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service. However, solving this -services MDP is computationally expensive due to the curse of dimensionality. To overcome this challenge, we show that the optimal policy for a single-service MDP has an appealing threshold structure, and derive explicitly the Whittle indices for each service as a function of the number of requests from customers based on the theory of Whittle index policy. Since…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Advanced Bandit Algorithms Research · Advanced Queuing Theory Analysis
Methodstravel james · Q-Learning
