Learning Augmented Index Policy for Optimal Service Placement at the   Network Edge

Guojun Xiong; Rahul Singh; Jian Li

arXiv:2101.03641·cs.NI·January 15, 2021·6 cites

Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

Guojun Xiong, Rahul Singh, Jian Li

PDF

Open Access

TL;DR

This paper develops learning-augmented algorithms for optimal service placement at the network edge, leveraging Whittle indices and addressing unknown, time-varying request rates to minimize latency.

Contribution

It derives explicit Whittle indices for single-service MDPs and introduces two novel algorithms, UCB-Whittle and Q-learning-Whittle, with theoretical performance guarantees.

Findings

01

Algorithms achieve low regret in learning request rates.

02

Proposed policies outperform baseline methods in simulations.

03

The approach effectively balances exploration and exploitation.

Abstract

We consider the problem of service placement at the network edge, in which a decision maker has to choose between $N$ services to host at the edge to satisfy the demands of customers. Our goal is to design adaptive algorithms to minimize the average service delivery latency for customers. We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service. However, solving this $N$ -services MDP is computationally expensive due to the curse of dimensionality. To overcome this challenge, we show that the optimal policy for a single-service MDP has an appealing threshold structure, and derive explicitly the Whittle indices for each service as a function of the number of requests from customers based on the theory of Whittle index policy. Since…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Advanced Bandit Algorithms Research · Advanced Queuing Theory Analysis

Methodstravel james · Q-Learning