StabilizerBench: A Benchmark for AI-Assisted Quantum Error Correction Circuit Synthesis

Andres Paz; Christian Tarta; Cordelia Yuqiao Li; Mayee Sun; Sarju Patel; Sylvie Lausier

arXiv:2604.21287·quant-ph·April 24, 2026

StabilizerBench: A Benchmark for AI-Assisted Quantum Error Correction Circuit Synthesis

Andres Paz, Christian Tarta, Cordelia Yuqiao Li, Mayee Sun, Sarju Patel, Sylvie Lausier

PDF

TL;DR

StabilizerBench is a comprehensive benchmark suite designed to evaluate AI agents on the synthesis, optimization, and fault-tolerant construction of quantum error correction circuits, facilitating progress measurement in quantum programming automation.

Contribution

It introduces a novel, scalable benchmark with scoring metrics for AI-generated stabilizer circuits across multiple tasks and code sizes, filling a gap in quantum error correction research.

Findings

01

Benchmark discriminates AI model performance effectively.

02

AI agents show significant room for improvement.

03

The suite enables scalable verification via Gottesman Knill theorem.

Abstract

As quantum hardware scales toward fault tolerant operation, the demand for correct quantum error correction (QEC) circuits far outpaces manual design capacity. AI agents offer a promising path to automating this synthesis, yet no benchmark exists to measure their progress on the specialized task of generating QEC circuits. We introduce StabilizerBench, a benchmark suite of 192 stabilizer codes spanning 12 families, 4-196 qubits, and distances 2-21, organized into three tasks of increasing difficulty: state preparation circuit generation, circuit optimization under semantic constraints, and fault tolerant circuit synthesis. Although motivated by QEC, stabilizer circuits exercise core competencies required for general quantum programming, including gate decomposition, qubit routing, and semantic preserving transformations, while admitting efficient verification via the Gottesman Knill…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.