L\'evy bandits under Poissonian decision times

Jos\'e-Luis P\'erez; Kazutoshi Yamazaki

arXiv:2301.07798·math.PR·January 20, 2023

L\'evy bandits under Poissonian decision times

Jos\'e-Luis P\'erez, Kazutoshi Yamazaki

PDF

Open Access

TL;DR

This paper analyzes a continuous-time multi-armed bandit problem with Poisson decision times, deriving explicit Gittins index expressions for spectrally one-sided Lévy processes and demonstrating convergence to classical Lévy bandit results.

Contribution

It provides explicit Gittins index formulas for Lévy bandits with Poisson decision times and establishes their convergence to classical models.

Findings

01

Explicit Gittins index in terms of scale functions

02

Convergence to classical Lévy bandit results

03

Applicable to spectrally one-sided Lévy processes

Abstract

We consider a version of the continuous-time multi-armed bandit problem where decision opportunities arrive at Poisson arrival times, and study its Gittins index policy. When driven by spectrally one-sided L\'evy processes, the Gittins index can be written explicitly in terms of the scale function, and is shown to converge to that in the classical L\'evy bandit of Kaspi and Mandelbaum (1995).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Supply Chain and Inventory Management