Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility   Cooperation

Soohyun Park; Jae Pyoung Kim; Chanyoung Park; Soyi Jung; Joongheon Kim

arXiv:2308.01519·cs.MA·August 4, 2023·2 cites

Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation

Soohyun Park, Jae Pyoung Kim, Chanyoung Park, Soyi Jung, Joongheon Kim

PDF

Open Access

TL;DR

This paper introduces a quantum multi-agent reinforcement learning algorithm designed for autonomous mobility cooperation, improving scalability, parameter efficiency, and convergence speed in multi-agent systems within the NISQ era.

Contribution

The paper proposes a novel quantum MARL algorithm with an actor-critic network and a projection value measure for enhanced scalability, efficiency, and convergence in multi-agent autonomous systems.

Findings

01

QMARL outperforms classical algorithms in reward and convergence speed.

02

PVM reduces action dimension logarithmically, improving scalability.

03

QMARL demonstrates superior parameter utilization and scalability.

Abstract

For Industry 4.0 Revolution, cooperative autonomous mobility systems are widely used based on multi-agent reinforcement learning (MARL). However, the MARL-based algorithms suffer from huge parameter utilization and convergence difficulties with many agents. To tackle these problems, a quantum MARL (QMARL) algorithm based on the concept of actor-critic network is proposed, which is beneficial in terms of scalability, to deal with the limitations in the noisy intermediate-scale quantum (NISQ) era. Additionally, our QMARL is also beneficial in terms of efficient parameter utilization and fast convergence due to quantum supremacy. Note that the reward in our QMARL is defined as task precision over computation time in multiple agents, thus, multi-agent cooperation can be realized. For further improvement, an additional technique for scalability is proposed, which is called projection value…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · stochastic dynamics and bifurcation · Neural dynamics and brain function