QuantFPFlow: Quantum Amplitude Estimation for Fokker--Planck Policy Optimisation in Continuous Reinforcement Learning

Abraham Itzhak Weinberg

arXiv:2605.16429·cs.LG·May 19, 2026

QuantFPFlow: Quantum Amplitude Estimation for Fokker--Planck Policy Optimisation in Continuous Reinforcement Learning

Abraham Itzhak Weinberg

PDF

TL;DR

QuantFPFlow introduces a quantum-inspired reinforcement learning framework that leverages amplitude estimation for efficient policy optimization, achieving quadratic speedup and improved exploration in continuous control tasks.

Contribution

It presents a novel quantum-inspired method integrating amplitude estimation into Fokker--Planck RL, enhancing efficiency and exploration in continuous reinforcement learning.

Findings

01

Achieves quadratic speedup in estimating the FP partition function.

02

Outperforms Soft Actor-Critic in a multimodal reward task.

03

Scales more efficiently with dimensionality compared to classical methods.

Abstract

We introduce \textbf{QuantFPFlow}, a reinforcement learning framework that integrates quantum amplitude estimation into the Fokker--Planck~(FP) formulation of stochastic policy optimisation. Classical continuous-space RL agents must estimate the FP partition function $Z = \int e^{- V (x) / D} d x$ at cost $\calO (1/ ε^{2})$ ; QuantFPFlow replaces this with a Grover-amplified amplitude estimator achieving $\calO (1/ ε)$ -- a provable quadratic speedup. While the full quantum acceleration requires fault-tolerant hardware, the quantum-inspired classical simulation demonstrated here already exhibits the $\calO (1/ ε)$ algorithmic structure. The estimated stationary distribution $\rhostar$ drives a theoretically grounded exploration bonus $\Raug = \Renv + α lo g (1/ \rhostar (s))$ . This bonus steers the agent toward globally optimal regions of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.