A unifying computations of Whittle's Index for Markovian bandits

Urtzi Ayesta; Manu K. Gupta; and Ina Maria Verloop

arXiv:1906.10946·math.OC·June 27, 2019·1 cites

A unifying computations of Whittle's Index for Markovian bandits

Urtzi Ayesta, Manu K. Gupta, and Ina Maria Verloop

PDF

Open Access

TL;DR

This paper derives a unified analytical expression for Whittle's index applicable to any Markovian bandit, simplifying computation and broadening its applicability across various decision-making problems.

Contribution

It provides the first general analytical formula for Whittle's index for Markovian bandits with finite and infinite transition rates, including conditions for indexability.

Findings

01

Derived a unifying expression for Whittle's index

02

Established conditions for threshold-type solutions and indexability

03

Validated the approach with examples from machine repair and communication networks

Abstract

The multi-armed restless bandit framework allows to model a wide variety of decision-making problems in areas as diverse as industrial engineering, computer communication, operations research, financial engineering, communication networks etc. In a seminal work, Whittle developed a methodology to derive well-performing (Whittle's) index policies that are obtained by solving a relaxed version of the original problem. However, the computation of Whittle's index itself is a difficult problem and hence researchers focused on calculating Whittle's index numerically or with a problem dependent approach. In our main contribution we derive an analytical expression for Whittle's index for any Markovian bandit with both finite and infinite transition rates. We derive sufficient conditions for the optimal solution of the relaxed problem to be of threshold type, and obtain conditions for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Supply Chain and Inventory Management