Decoding surface codes with deep reinforcement learning and probabilistic policy reuse
Elisha Siddiqui Matekole, Esther Ye, Ramya Iyer, and Samuel Yen-Chi, Chen

TL;DR
This paper introduces a continual reinforcement learning approach using DDQN-PPR to improve quantum surface code decoding, effectively adapting to changing noise patterns and reducing computational complexity.
Contribution
It presents a novel continual RL method with probabilistic policy reuse for surface code decoding in quantum computing, enhancing adaptability and efficiency.
Findings
Significant reduction in computational complexity.
Improved decoding performance with more trained policies.
Effective adaptation to varying noise patterns.
Abstract
Quantum computing (QC) promises significant advantages on certain hard computational tasks over classical computers. However, current quantum hardware, also known as noisy intermediate-scale quantum computers (NISQ), are still unable to carry out computations faithfully mainly because of the lack of quantum error correction (QEC) capability. A significant amount of theoretical studies have provided various types of QEC codes; one of the notable topological codes is the surface code, and its features, such as the requirement of only nearest-neighboring two-qubit control gates and a large error threshold, make it a leading candidate for scalable quantum computation. Recent developments of machine learning (ML)-based techniques especially the reinforcement learning (RL) methods have been applied to the decoding problem and have already made certain progress. Nevertheless, the device noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices
MethodsQ-Learning
