Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning
Andrea Skolik, Sofiene Jerbi, Vedran Dunjko

TL;DR
This paper introduces a variational quantum algorithm for deep Q-learning, exploring how quantum agents can be designed and optimized for reinforcement learning tasks, and comparing their performance to classical algorithms.
Contribution
The authors develop a quantum deep Q-learning method using parametrized quantum circuits and analyze the impact of architectural choices and observables on performance.
Findings
Quantum Q-learning performance depends on observables and encoding strategies.
Architectural choices and hyperparameters are more influential than the number of parameters.
Quantum agents can be competitive with classical algorithms in certain environments.
Abstract
Quantum machine learning (QML) has been identified as one of the key fields that could reap advantages from near-term quantum devices, next to optimization and quantum chemistry. Research in this area has focused primarily on variational quantum algorithms (VQAs), and several proposals to enhance supervised, unsupervised and reinforcement learning (RL) algorithms with VQAs have been put forward. Out of the three, RL is the least studied and it is still an open question whether VQAs can be competitive with state-of-the-art classical algorithms based on neural networks (NNs) even on simple benchmark tasks. In this work, we introduce a training method for parametrized quantum circuits (PQCs) that can be used to solve RL tasks for discrete and continuous state spaces based on the deep Q-learning algorithm. We investigate which architectural choices for quantum Q-learning agents are most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum Information and Cryptography · Neural Networks and Reservoir Computing
