Realizing a deep reinforcement learning agent discovering real-time feedback control strategies for a quantum system
Kevin Reuer, Jonas Landgraf, Thomas F\"osel, James O'Sullivan, Liberto, Beltr\'an, Abdulkadir Akin, Graham J. Norris, Ants Remm, Michael Kerschbaum,, Jean-Claude Besse, Florian Marquardt, Andreas Wallraff, Christopher Eichler

TL;DR
This paper presents a latency-optimized deep reinforcement learning agent implemented on FPGA for real-time feedback control of superconducting qubits, enabling efficient state initialization without detailed device models.
Contribution
The authors develop and demonstrate a deep reinforcement learning agent on FPGA capable of real-time quantum feedback control, a challenge previously unaddressed.
Findings
Agent successfully initializes superconducting qubits into target states
Reinforcement learning outperforms threshold-based strategies
Effective across different measurement types and readout levels
Abstract
To realize the full potential of quantum technologies, finding good strategies to control quantum information processing devices in real time becomes increasingly important. Usually these strategies require a precise understanding of the device itself, which is generally not available. Model-free reinforcement learning circumvents this need by discovering control strategies from scratch without relying on an accurate description of the quantum system. Furthermore, important tasks like state preparation, gate teleportation and error correction need feedback at time scales much shorter than the coherence time, which for superconducting circuits is in the microsecond range. Developing and training a deep reinforcement learning agent able to operate in this real-time feedback regime has been an open challenge. Here, we have implemented such an agent in the form of a latency-optimized deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
