Decentralized Multi-Robot Formation Control Using Reinforcement Learning
Juraj Obradovic, Marko Krizmancic, Stjepan Bogdan

TL;DR
This paper introduces a decentralized reinforcement learning approach using Double Deep Q-Networks for multi-robot formation control, enabling stable formation achievement and maintenance without complex models.
Contribution
It applies DDQN to multi-robot formation control, demonstrating effective decentralized coordination with simple discrete actions in real and simulated environments.
Findings
Successful formation achievement in simulation and real-world tests
Stable formation maintenance without complex control laws
Effective use of DDQN for decentralized multi-robot coordination
Abstract
This paper presents a decentralized leader-follower multi-robot formation control based on a reinforcement learning (RL) algorithm applied to a swarm of small educational Sphero robots. Since the basic Q-learning method is known to require large memory resources for Q-tables, this work implements the Double Deep Q-Network (DDQN) algorithm, which has achieved excellent results in many robotic problems. To enhance the system behavior, we trained two different DDQN models, one for reaching the formation and the other for maintaining it. The models use a discrete set of robot motions (actions) to adapt the continuous nonlinear system to the discrete nature of RL. The presented approach has been tested in simulation and real experiments which show that the multi-robot system can achieve and maintain a stable formation without the need for complex mathematical models and nonlinear control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Neural Networks and Reservoir Computing · Adaptive Dynamic Programming Control
MethodsQ-Learning
