Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach

Yang Xu; Vaneet Aggarwal

arXiv:2501.16243·quant-ph·July 2, 2025

Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach

Yang Xu, Vaneet Aggarwal

PDF

Open Access

TL;DR

This paper introduces a Quantum Natural Policy Gradient algorithm that accelerates quantum reinforcement learning by reducing sample complexity and integrating deterministic gradient estimation into quantum systems.

Contribution

The paper proposes a novel QNPG algorithm that replaces stochastic sampling with deterministic estimation, improving efficiency in quantum reinforcement learning.

Findings

01

Achieves a sample complexity of ~O(ε^{-1.5}) for quantum oracle queries.

02

Significantly outperforms classical lower bounds of ~O(ε^{-2}).

03

Demonstrates effective integration of deterministic gradient estimation in quantum settings.

Abstract

We address the problem of quantum reinforcement learning (QRL) under model-free settings with quantum oracle access to the Markov Decision Process (MDP). This paper introduces a Quantum Natural Policy Gradient (QNPG) algorithm, which replaces the random sampling used in classical Natural Policy Gradient (NPG) estimators with a deterministic gradient estimation approach, enabling seamless integration into quantum systems. While this modification introduces a bounded bias in the estimator, the bias decays exponentially with increasing truncation levels. This paper demonstrates that the proposed QNPG algorithm achieves a sample complexity of $\tilde{O} (ϵ^{- 1.5})$ for queries to the quantum oracle, significantly improving the classical lower bound of $\tilde{O} (ϵ^{- 2})$ for queries to the MDP.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Quantum Information and Cryptography