A fast-pivoting algorithm for Whittle's restless bandit index

Jos\'e Ni\~no-Mora

arXiv:2601.14819·math.OC·January 22, 2026

A fast-pivoting algorithm for Whittle's restless bandit index

Jos\'e Ni\~no-Mora

PDF

Open Access

TL;DR

This paper introduces a novel fast-pivoting algorithm that efficiently computes Whittle's index for large-state restless bandits, significantly reducing computational complexity and runtime compared to existing methods.

Contribution

The paper presents a new algorithm leveraging parametric simplex methods and pattern analysis to compute Whittle's index with improved speed for large state spaces.

Findings

01

Substantial runtime speed-ups demonstrated in numerical experiments

02

Algorithm reduces complexity to approximately (2/3) n^3 operations after initialization

03

Effective for large-state restless bandit problems

Abstract

The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and which can change state while passive. It further provides a practical heuristic priority-index policy for the computationally intractable multi-armed restless bandit problem, which has been widely applied over the last three decades in multifarious settings, yet mostly restricted to project models with a one-dimensional state. This is due in part to the difficulty of establishing indexability (existence of the index) and of computing the index for projects with large state spaces. This paper draws on the author's prior results on sufficient indexability conditions and an adaptive-greedy algorithmic scheme for restless bandits to obtain a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Auction Theory and Applications