Uncovering Instabilities in Variational-Quantum Deep Q-Networks

Maja Franz (1); Lucas Wolf (1); Maniraman Periyasamy (2); Christian; Ufrecht (2); Daniel D. Scherer (2); Axel Plinge (2); Christopher Mutschler; (2); Wolfgang Mauerer (1,3) ((1) Technical University of Applied Sciences,; Regensburg; Germany; (2) Fraunhofer-IIS; Fraunhofer Institute for Integrated; Circuits IIS; Division Positioning; Networks; Nuremberg; Germany; (3); Siemens AG; Corporate Research; Munich; Germany)

arXiv:2202.05195·quant-ph·September 19, 2022·1 cites

Uncovering Instabilities in Variational-Quantum Deep Q-Networks

Maja Franz (1), Lucas Wolf (1), Maniraman Periyasamy (2), Christian, Ufrecht (2), Daniel D. Scherer (2), Axel Plinge (2), Christopher Mutschler, (2), Wolfgang Mauerer (1,3) ((1) Technical University of Applied Sciences,, Regensburg, Germany, (2) Fraunhofer-IIS

PDF

Open Access 1 Repo

TL;DR

This paper investigates the instabilities and reproducibility issues of variational quantum deep Q-networks (VQ-DQN) in reinforcement learning, compares simulated and real quantum hardware performance, and provides a robust implementation for future research.

Contribution

It systematically analyzes the stability of VQ-DQN algorithms, compares simulation with real quantum hardware, and offers a reproducible implementation for further studies.

Findings

01

VQ-DQN algorithms exhibit significant instabilities leading to policy divergence.

02

Experiments on actual quantum hardware reveal differences from simulated results.

03

No conclusive evidence that quantum approaches outperform classical methods in RL.

Abstract

Deep Reinforcement Learning (RL) has considerably advanced over the past decade. At the same time, state-of-the-art RL algorithms require a large computational budget in terms of training time to converge. Recent work has started to approach this problem through the lens of quantum computing, which promises theoretical speed-ups for several traditionally hard tasks. In this work, we examine a class of hybrid quantum-classical RL algorithms that we collectively refer to as variational quantum deep Q-networks (VQ-DQN). We show that VQ-DQN approaches are subject to instabilities that cause the learned policy to diverge, study the extent to which this afflicts reproduciblity of established results based on classical simulation, and perform systematic experiments to identify potential explanations for the observed instabilities. Additionally, and in contrast to most existing work on quantum…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lfd/quantum-rl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing