Reinforced Disentanglers on Random Unitary Circuits
Ning Bao, Keiichiro Furuya, Gun Suer

TL;DR
This paper employs reinforcement learning to identify minimal measurement-based disentanglers in random Clifford circuits, revealing that fewer measurements are needed for disentanglement than previously thought and characterizing their optimal patterns.
Contribution
It introduces a reinforcement learning approach to find efficient disentanglers in quantum circuits, surpassing prior measurement counts and enabling pattern characterization.
Findings
Fewer measurements are needed for disentanglement than earlier estimates.
Reinforcement learning effectively finds optimal measurement patterns.
Disentanglers exhibit specific, characterizable measurement configurations.
Abstract
We search for efficient disentanglers on random Clifford circuits of two-qubit gates arranged in a brick-wall pattern, using the proximal policy optimization (PPO) algorithm \cite{schulman2017proximalpolicyoptimizationalgorithms}. Disentanglers are defined as a set of projective measurements inserted between consecutive entangling layers. An efficient disentangler is a set of projective measurements that minimize the averaged von Neumann entropy of the final state with the least number of total projections possible. The problem is naturally amenable to reinforcement learning techniques by taking the binary matrix representing the projective measurements along the circuit as our state, and actions as bit flipping operations on this binary matrix that add or delete measurements at specified locations. We give rewards to our agent dependent on the averaged von Neumann entropy of the final…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Chaos-based Image/Signal Encryption · Handwritten Text Recognition Techniques
MethodsSparse Evolutionary Training
