VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability
Rodrigo Coelho, Andr\'e Sequeira, Lu\'is Paulo Santos

TL;DR
This paper investigates the use of Variational Quantum Circuits in Deep Q-Learning, focusing on how data re-uploading influences performance and trainability in classic control tasks, revealing promising trainability properties.
Contribution
It provides an empirical analysis of VQC-based Deep Q-Learning, highlighting the effects of data re-uploading and the non-vanishing gradients with increasing qubits.
Findings
Gradients remain substantial throughout training.
Increasing qubits does not cause exponential gradient vanishing.
VQCs may be suitable as function approximators in RL.
Abstract
Reinforcement Learning (RL) consists of designing agents that make intelligent decisions without human supervision. When used alongside function approximators such as Neural Networks (NNs), RL is capable of solving extremely complex problems. Deep Q-Learning, a RL algorithm that uses Deep NNs, achieved super-human performance in some specific tasks. Nonetheless, it is also possible to use Variational Quantum Circuits (VQCs) as function approximators in RL algorithms. This work empirically studies the performance and trainability of such VQC-based Deep Q-Learning models in classic control benchmark environments. More specifically, we research how data re-uploading affects both these metrics. We show that the magnitude and the variance of the gradients of these models remain substantial throughout training due to the moving targets of Deep Q-Learning. Moreover, we empirically show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvancements in Semiconductor Devices and Circuit Design · Quantum Computing Algorithms and Architecture · Quantum and electron transport phenomena
MethodsQ-Learning
