Policy Gradient Approach to Compilation of Variational Quantum Circuits
David A. Herrera-Mart\'i

TL;DR
This paper introduces a policy gradient reinforcement learning method for approximating quantum circuit compilations, offering advantages over gradient-free and traditional methods, especially in noisy environments.
Contribution
It presents a novel policy gradient approach for variational quantum circuit compilation that avoids the need for additional registers and long-range interactions.
Findings
More competitive than gradient-free methods with similar resources
Effective in both noiseless and noisy circuit scenarios
Does not require a separate register for fidelity estimation
Abstract
We propose a method for finding approximate compilations of quantum unitary transformations, based on techniques from policy gradient reinforcement learning. The choice of a stochastic policy allows us to rephrase the optimization problem in terms of probability distributions, rather than variational gates. In this framework, the optimal configuration is found by optimizing over distribution parameters, rather than over free angles. We show numerically that this approach can be more competitive than gradient-free methods, for a comparable amount of resources, both for noiseless and noisy circuits. Another interesting feature of this approach to variational compilation is that it does not need a separate register and long-range interactions to estimate the end-point fidelity, which is an improvement over methods which rely on the Hilbert-Schmidt test. We expect these techniques to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum Information and Cryptography
