Decentralized Multi-Robot Formation Control Using Reinforcement Learning

Juraj Obradovic; Marko Krizmancic; Stjepan Bogdan

arXiv:2306.14489·cs.RO·July 17, 2023

Decentralized Multi-Robot Formation Control Using Reinforcement Learning

Juraj Obradovic, Marko Krizmancic, Stjepan Bogdan

PDF

Open Access

TL;DR

This paper introduces a decentralized reinforcement learning approach using Double Deep Q-Networks for multi-robot formation control, enabling stable formation achievement and maintenance without complex models.

Contribution

It applies DDQN to multi-robot formation control, demonstrating effective decentralized coordination with simple discrete actions in real and simulated environments.

Findings

01

Successful formation achievement in simulation and real-world tests

02

Stable formation maintenance without complex control laws

03

Effective use of DDQN for decentralized multi-robot coordination

Abstract

This paper presents a decentralized leader-follower multi-robot formation control based on a reinforcement learning (RL) algorithm applied to a swarm of small educational Sphero robots. Since the basic Q-learning method is known to require large memory resources for Q-tables, this work implements the Double Deep Q-Network (DDQN) algorithm, which has achieved excellent results in many robotic problems. To enhance the system behavior, we trained two different DDQN models, one for reaching the formation and the other for maintaining it. The models use a discrete set of robot motions (actions) to adapt the continuous nonlinear system to the discrete nature of RL. The presented approach has been tested in simulation and real experiments which show that the multi-robot system can achieve and maintain a stable formation without the need for complex mathematical models and nonlinear control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Neural Networks and Reservoir Computing · Adaptive Dynamic Programming Control

MethodsQ-Learning