Towards Real-time Control of a CartPole System on a Quantum Computer
Nguyen Truong Thu Ngo, V\"ain\"o Mehtola, J\'erome Lenssen, Peiyong Wang, Francesco Cosco, Tien-Fu Lu, James Q. Quach

TL;DR
This paper demonstrates a minimal hybrid quantum-classical approach to real-time CartPole control on a superconducting quantum processor, highlighting the impact of inference frequency and shot count on stability and performance.
Contribution
It presents the first end-to-end implementation of a quantum reinforcement learning agent on a physical QPU for real-time control, analyzing hardware latency and control trade-offs.
Findings
Single-qubit quantum agent outperforms classical in fewer episodes.
Higher inference frequency improves control stability.
Increased shot count reduces minimum frequency needed for near-maximal performance.
Abstract
The application of quantum reinforcement learning (QRL) to real-time control systems faces significant challenges regarding hardware latency, noise susceptibility, and learning convergence. This work presents an end-to-end investigation of a minimal hybrid quantum-classical agent applied to the CartPole benchmark, addressing the gap between idealized simulation and execution on a physical superconducting quantum processing unit (QPU). We demonstrate that a single-qubit agent acts as an effective learning model, solving the environment in substantially fewer episodes than a comparable classical actor-critic network even when the training of the hybrid agent is restricted to use parameter-shift for its quantum circuit component. To connect learning to deployment constraints, we map the inference-time trade-off between control-loop rate and measurement shot budget to provide guidance for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
