Realizing a deep reinforcement learning agent discovering real-time   feedback control strategies for a quantum system

Kevin Reuer; Jonas Landgraf; Thomas F\"osel; James O'Sullivan; Liberto; Beltr\'an; Abdulkadir Akin; Graham J. Norris; Ants Remm; Michael Kerschbaum,; Jean-Claude Besse; Florian Marquardt; Andreas Wallraff; Christopher Eichler

arXiv:2210.16715·quant-ph·November 9, 2023

Realizing a deep reinforcement learning agent discovering real-time feedback control strategies for a quantum system

Kevin Reuer, Jonas Landgraf, Thomas F\"osel, James O'Sullivan, Liberto, Beltr\'an, Abdulkadir Akin, Graham J. Norris, Ants Remm, Michael Kerschbaum,, Jean-Claude Besse, Florian Marquardt, Andreas Wallraff, Christopher Eichler

PDF

TL;DR

This paper presents a latency-optimized deep reinforcement learning agent implemented on FPGA for real-time feedback control of superconducting qubits, enabling efficient state initialization without detailed device models.

Contribution

The authors develop and demonstrate a deep reinforcement learning agent on FPGA capable of real-time quantum feedback control, a challenge previously unaddressed.

Findings

01

Agent successfully initializes superconducting qubits into target states

02

Reinforcement learning outperforms threshold-based strategies

03

Effective across different measurement types and readout levels

Abstract

To realize the full potential of quantum technologies, finding good strategies to control quantum information processing devices in real time becomes increasingly important. Usually these strategies require a precise understanding of the device itself, which is generally not available. Model-free reinforcement learning circumvents this need by discovering control strategies from scratch without relying on an accurate description of the quantum system. Furthermore, important tasks like state preparation, gate teleportation and error correction need feedback at time scales much shorter than the coherence time, which for superconducting circuits is in the microsecond range. Developing and training a deep reinforcement learning agent able to operate in this real-time feedback regime has been an open challenge. Here, we have implemented such an agent in the form of a latency-optimized deep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.