Quantum Algorithms for Finite-horizon Markov Decision Processes

Bin Luo; Yuwen Huang; Jonathan Allcock; Xiaojun Lin; Shengyu Zhang; John C.S. Lui

arXiv:2508.05712·quant-ph·August 11, 2025·ICML

Quantum Algorithms for Finite-horizon Markov Decision Processes

Bin Luo, Yuwen Huang, Jonathan Allcock, Xiaojun Lin, Shengyu Zhang, John C.S. Lui

PDF

Open Access

TL;DR

This paper introduces quantum algorithms that significantly improve the efficiency of solving finite-horizon Markov Decision Processes, achieving quadratic speedups and optimal sample complexities in various settings.

Contribution

The paper presents novel quantum algorithms for finite-horizon MDPs that outperform classical methods in both exact dynamics and generative model settings, with proven speedups and optimal bounds.

Findings

01

Quadratic speedup in action space for classical value iteration

02

Additional speedup in state space for near-optimal policies

03

Quantum algorithms achieve asymptotic optimality in sample complexity

Abstract

In this work, we design quantum algorithms that are more efficient than classical algorithms to solve time-dependent and finite-horizon Markov Decision Processes (MDPs) in two distinct settings: (1) In the exact dynamics setting, where the agent has full knowledge of the environment's dynamics (i.e., transition probabilities), we prove that our $Quantum Value Iteration (QVI)$ algorithm $QVI-1$ achieves a quadratic speedup in the size of the action space $(A)$ compared with the classical value iteration algorithm for computing the optimal policy ( $π^{*}$ ) and the optimal V-value function ( $V_{0}^{*}$ ). Furthermore, our algorithm $QVI-2$ provides an additional speedup in the size of the state space $(S)$ when obtaining near-optimal policies and V-value functions. Both $QVI-1$ and $QVI-2$ achieve quantum query complexities that provably…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture