TL;DR
This paper introduces a novel approach using variational quantum circuits to implement deep reinforcement learning algorithms, aiming for feasible deployment on near-term quantum devices and reducing model complexity.
Contribution
It presents the first proof-of-principle demonstration of variational quantum circuits approximating deep Q-values in reinforcement learning, integrating experience replay and target networks.
Findings
Quantum circuits effectively approximate deep Q-values.
Reduced parameter count compared to classical neural networks.
Feasible deployment on NISQ devices.
Abstract
The state-of-the-art machine learning approaches are based on classical von Neumann computing architectures and have been widely used in many industrial and academic domains. With the recent development of quantum computing, researchers and tech-giants have attempted new quantum circuits for machine learning tasks. However, the existing quantum computing platforms are hard to simulate classical deep learning models or problems because of the intractability of deep quantum circuits. Thus, it is necessary to design feasible quantum algorithms for quantum machine learning for noisy intermediate scale quantum (NISQ) devices. This work explores variational quantum circuits for deep reinforcement learning. Specifically, we reshape classical deep reinforcement learning algorithms like experience replay and target network into a representation of variational quantum circuits. Moreover, we use a…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 1
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 1
Figure 4
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsExperience Replay
