L\'evy bandits under Poissonian decision times
Jos\'e-Luis P\'erez, Kazutoshi Yamazaki

TL;DR
This paper analyzes a continuous-time multi-armed bandit problem with Poisson decision times, deriving explicit Gittins index expressions for spectrally one-sided Lévy processes and demonstrating convergence to classical Lévy bandit results.
Contribution
It provides explicit Gittins index formulas for Lévy bandits with Poisson decision times and establishes their convergence to classical models.
Findings
Explicit Gittins index in terms of scale functions
Convergence to classical Lévy bandit results
Applicable to spectrally one-sided Lévy processes
Abstract
We consider a version of the continuous-time multi-armed bandit problem where decision opportunities arrive at Poisson arrival times, and study its Gittins index policy. When driven by spectrally one-sided L\'evy processes, the Gittins index can be written explicitly in terms of the scale function, and is shown to converge to that in the classical L\'evy bandit of Kaspi and Mandelbaum (1995).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Supply Chain and Inventory Management
