On the interpretability of neural network decoders
Lukas B\"odeker, Luc J. B. Kusters, Markus M\"uller

TL;DR
This paper introduces an interpretability toolbox using Shapley values to understand neural network decoders in quantum error correction, revealing how they learn to make fault-tolerant decisions and aiding improvements.
Contribution
It applies interpretability methods to neural decoders in quantum error correction, providing insights into their decision processes and guiding architecture enhancements.
Findings
Decoding decisions can be interpreted to understand learned structures.
Identifies flaws in syndrome processing affecting performance.
Supports improvements in neural decoder design.
Abstract
Neural-network (NN) based decoders are becoming increasingly popular in the field of quantum error correction (QEC), including for decoding of state-of-the-art quantum computation experiments. In this work, we make use of established interpretability methods from the field of machine learning, to introduce a toolbox to achieve an understanding of the underlying decoding logic of NN decoders, which have been trained but otherwise typically operate as black-box models. To illustrate the capabilities of the employed interpretability method, based on the Shapley value approximation, we provide an examplary case study of a NN decoder that is trained for flag-qubit based fault-tolerant (FT) QEC with the Steane code. We show how particular decoding decisions of the NN can be interpreted, and reveal how the NN learns to capture fundamental structures in the information gained from syndrome and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
