Decentralized multi-agent reinforcement learning algorithm using a   cluster-synchronized laser network

Shun Kotoku; Takatomo Mihana; Andr\'e R\"ohm; and Ryoichi Horisaki

arXiv:2407.09124·cs.LG·July 15, 2024

Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network

Shun Kotoku, Takatomo Mihana, Andr\'e R\"ohm, and Ryoichi Horisaki

PDF

Open Access

TL;DR

This paper introduces a photonic-based decentralized multi-agent reinforcement learning algorithm leveraging laser network synchronization to solve the competitive multi-armed bandit problem efficiently.

Contribution

It presents a novel approach using chaotic laser oscillations and cluster synchronization for decentralized decision-making in MARL without explicit information sharing.

Findings

01

Chaotic laser oscillations enable effective exploration and exploitation.

02

Cluster synchronization facilitates cooperative decision-making.

03

Decentralized control is achieved through physical laser network dynamics.

Abstract

Multi-agent reinforcement learning (MARL) studies crucial principles that are applicable to a variety of fields, including wireless networking and autonomous driving. We propose a photonic-based decision-making algorithm to address one of the most fundamental problems in MARL, called the competitive multi-armed bandit (CMAB) problem. Our numerical simulations demonstrate that chaotic oscillations and cluster synchronization of optically coupled lasers, along with our proposed decentralized coupling adjustment, efficiently balance exploration and exploitation while facilitating cooperative decision-making without explicitly sharing information among agents. Our study demonstrates how decentralized reinforcement learning can be achieved by exploiting complex physical processes controlled by simple algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemiconductor Lasers and Optical Devices · Extremum Seeking Control Systems · Neural Networks and Reservoir Computing