Towards Multi-Agent Reinforcement Learning using Quantum Boltzmann   Machines

Tobias M\"uller; Christoph Roch; Kyrill Schmid; Philipp Altmann

arXiv:2109.10900·cs.AI·November 23, 2021

Towards Multi-Agent Reinforcement Learning using Quantum Boltzmann Machines

Tobias M\"uller, Christoph Roch, Kyrill Schmid, Philipp Altmann

PDF

Open Access

TL;DR

This paper extends quantum Boltzmann machine-based multi-agent reinforcement learning to more complex grid domains, demonstrating improved stability and policy optimization, while analyzing the impact of parameter sharing and quantum sampling limitations.

Contribution

It introduces an extended MARL algorithm using QBMs with experience replay and multiple networks, enabling application to larger, more complex environments.

Findings

01

Learning becomes more stable in complex grid domains.

02

Agents can find optimal policies in higher complexity environments.

03

Quantum sampling shows promise but is limited by hardware constraints.

Abstract

Reinforcement learning has driven impressive advances in machine learning. Simultaneously, quantum-enhanced machine learning algorithms using quantum annealing underlie heavy developments. Recently, a multi-agent reinforcement learning (MARL) architecture combining both paradigms has been proposed. This novel algorithm, which utilizes Quantum Boltzmann Machines (QBMs) for Q-value approximation has outperformed regular deep reinforcement learning in terms of time-steps needed to converge. However, this algorithm was restricted to single-agent and small 2x2 multi-agent grid domains. In this work, we propose an extension to the original concept in order to solve more challenging problems. Similar to classic DQNs, we add an experience replay buffer and use different networks for approximating the target and policy values. The experimental results show that learning becomes more stable and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Advanced Memory and Neural Computing · Machine Learning and ELM

MethodsExperience Replay