AlphaCNOT: Learning CNOT Minimization with Model-Based Planning
Jacopo Cossio, Daniele Lizzio Bosco, Riccardo Romanello, Giuseppe Serra, Carla Piazza

TL;DR
AlphaCNOT introduces a model-based reinforcement learning framework using Monte Carlo Tree Search for efficient CNOT gate minimization in quantum circuits, outperforming existing methods.
Contribution
It is the first to combine RL with lookahead search for CNOT minimization, achieving significant reductions over prior heuristic and RL-based approaches.
Findings
Up to 32% reduction in CNOT gates compared to PMH baseline.
Consistent gate count reduction across various topologies with up to 8 qubits.
Model-based RL with search outperforms previous RL solutions.
Abstract
Quantum circuit optimization is a central task in Quantum Computing, as current Noisy Intermediate Scale Quantum devices suffer from error propagation that often scales with the number of operations. Among quantum operations, the CNOT gate is of fundamental importance, being the only 2-qubit gate in the universal Clifford+T set. The problem of CNOT gates minimization has been addressed by heuristic algorithms such as the well-known Patel-Markov-Hayes (PMH) for linear reversible synthesis (i.e., CNOT minimization with no topological constraints), and more recently by Reinforcement Learning (RL) based strategies in the more complex case of topology-aware synthesis, where each CNOT can act on a subset of all qubits pairs. In this work we introduce AlphaCNOT, a RL framework based on Monte Carlo Tree Search (MCTS) that address effectively the CNOT minimization problem by modeling it as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
