Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis
Richie Yeung, Aleks Kissinger, Rob Cornish

TL;DR
This paper introduces a size-agnostic, equivariant reinforcement learning approach for synthesizing Clifford quantum circuits, achieving near-optimal results efficiently across various qubit sizes.
Contribution
It presents a novel neural network architecture that is equivariant and size-agnostic, enabling scalable and efficient Clifford circuit synthesis with reinforcement learning.
Findings
Agent finds circuits within one two-qubit gate of optimality in milliseconds.
Achieves 99.2% optimal circuits within seconds for six-qubit instances.
Outperforms existing Clifford synthesizers on large, unseen qubit instances.
Abstract
We consider the problem of synthesizing Clifford quantum circuits for devices with all-to-all qubit connectivity. We approach this task as a reinforcement learning problem in which an agent learns to discover a sequence of elementary Clifford gates that reduces a given symplectic matrix representation of a Clifford circuit to the identity. This formulation permits a simple learning curriculum based on random walks from the identity. We introduce a novel neural network architecture that is equivariant to qubit relabelings of the symplectic matrix representation, and which is size-agnostic, allowing a single learned policy to be applied across different qubit counts without circuit splicing or network reparameterization. On six-qubit Clifford circuits, the largest regime for which optimal references are available, our agent finds circuits within one two-qubit gate of optimality in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
