Reinforcement learning for optimal error correction of toric codes

Laia Domingo Colomer; Michalis Skotiniotis; and Ramon Mu\~noz-Tapia

arXiv:1911.02308·quant-ph·March 9, 2020

Reinforcement learning for optimal error correction of toric codes

Laia Domingo Colomer, Michalis Skotiniotis, and Ramon Mu\~noz-Tapia

PDF

TL;DR

This paper introduces a deep reinforcement learning approach to optimize error correction in toric codes, achieving near-theoretical thresholds and discovering policies akin to traditional methods without prior bias.

Contribution

The study demonstrates the effectiveness of deep reinforcement learning in designing high-threshold decoders for toric codes, matching near-optimal performance.

Findings

01

Achieves near 11% threshold for uncorrelated noise

02

Agent's policy resembles minimum weight perfect matching

03

Deep convolutional networks enable effective decoding

Abstract

We apply deep reinforcement learning techniques to design high threshold decoders for the toric code under uncorrelated noise. By rewarding the agent only if the decoding procedure preserves the logical states of the toric code, and using deep convolutional networks for the training phase of the agent, we observe near-optimal performance for uncorrelated noise around the theoretically optimal threshold of 11%. We observe that, by and large, the agent implements a policy similar to that of minimum weight perfect matchings even though no bias towards any policy is given a priori.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.