Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation
Guohui Ding, Joewie J. Koh, Kelly Merckaert, Bram Vanderborght, Marco, M. Nicotra, Christoffer Heckman, Alessandro Roncone, Lijun Chen

TL;DR
This paper introduces two distributed reinforcement learning methods for cooperative multi-robot object manipulation, demonstrating their effectiveness in simulation and potential scalability to larger multi-agent systems.
Contribution
It proposes two novel distributed multi-agent RL algorithms, DA-RL and GT-RL, tailored for cooperative multi-robot tasks, with validation in simulation.
Findings
Both methods successfully enable cooperative manipulation in simulation.
The approaches are scalable and applicable to larger multi-agent systems.
The methods outperform baseline approaches in cooperative tasks.
Abstract
We consider solving a cooperative multi-robot object manipulation task using reinforcement learning (RL). We propose two distributed multi-agent RL approaches: distributed approximate RL (DA-RL), where each agent applies Q-learning with individual reward functions; and game-theoretic RL (GT-RL), where the agents update their Q-values based on the Nash equilibrium of a bimatrix Q-value game. We validate the proposed approaches in the setting of cooperative object manipulation with two simulated robot arms. Although we focus on a small system of two agents in this paper, both DA-RL and GT-RL apply to general multi-agent systems, and are expected to scale well to large systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
MethodsQ-Learning
