On Decentralizing Federated Reinforcement Learning in Multi-Robot   Scenarios

Jayprakash S. Nair; Divya D. Kulkarni; Ajitem Joshi; Sruthy Suresh

arXiv:2207.09372·cs.RO·September 9, 2022·1 cites

On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios

Jayprakash S. Nair, Divya D. Kulkarni, Ajitem Joshi, Sruthy Suresh

PDF

Open Access

TL;DR

This paper proposes a decentralized federated reinforcement learning approach for multi-robot systems, utilizing mobile agents to avoid central server failures and bandwidth bottlenecks, demonstrated through simulations with Q-learning and SARSA.

Contribution

It introduces a mobile agent-based paradigm for decentralizing federated learning in multi-robot scenarios, enhancing robustness and scalability.

Findings

01

Decentralized FL is viable with mobile agents in robotics.

02

Experiments with Q-learning and SARSA show effective model aggregation.

03

Decentralized FL can be integrated with various learning algorithms.

Abstract

Federated Learning (FL) allows for collaboratively aggregating learned information across several computing devices and sharing the same amongst them, thereby tackling issues of privacy and the need of huge bandwidth. FL techniques generally use a central server or cloud for aggregating the models received from the devices. Such centralized FL techniques suffer from inherent problems such as failure of the central node and bottlenecks in channel bandwidth. When FL is used in conjunction with connected robots serving as devices, a failure of the central controlling entity can lead to a chaotic situation. This paper describes a mobile agent based paradigm to decentralize FL in multi-robot scenarios. Using Webots, a popular free open-source robot simulator, and Tartarus, a mobile agent platform, we present a methodology to decentralize federated learning in a set of connected robots. With…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cooperative Communication and Network Coding · Access Control and Trust

MethodsSarsa · Q-Learning