Towards Real-time Control of a CartPole System on a Quantum Computer

Nguyen Truong Thu Ngo; V\"ain\"o Mehtola; J\'erome Lenssen; Peiyong Wang; Francesco Cosco; Tien-Fu Lu; James Q. Quach

arXiv:2605.01716·quant-ph·May 5, 2026

Towards Real-time Control of a CartPole System on a Quantum Computer

Nguyen Truong Thu Ngo, V\"ain\"o Mehtola, J\'erome Lenssen, Peiyong Wang, Francesco Cosco, Tien-Fu Lu, James Q. Quach

PDF

TL;DR

This paper demonstrates a minimal hybrid quantum-classical approach to real-time CartPole control on a superconducting quantum processor, highlighting the impact of inference frequency and shot count on stability and performance.

Contribution

It presents the first end-to-end implementation of a quantum reinforcement learning agent on a physical QPU for real-time control, analyzing hardware latency and control trade-offs.

Findings

01

Single-qubit quantum agent outperforms classical in fewer episodes.

02

Higher inference frequency improves control stability.

03

Increased shot count reduces minimum frequency needed for near-maximal performance.

Abstract

The application of quantum reinforcement learning (QRL) to real-time control systems faces significant challenges regarding hardware latency, noise susceptibility, and learning convergence. This work presents an end-to-end investigation of a minimal hybrid quantum-classical agent applied to the CartPole benchmark, addressing the gap between idealized simulation and execution on a physical superconducting quantum processing unit (QPU). We demonstrate that a single-qubit agent acts as an effective learning model, solving the environment in substantially fewer episodes than a comparable classical actor-critic network even when the training of the hybrid agent is restricted to use parameter-shift for its quantum circuit component. To connect learning to deployment constraints, we map the inference-time trade-off between control-loop rate and measurement shot budget to provide guidance for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.