Federated Ensemble-Directed Offline Reinforcement Learning

Desik Rengarajan; Nitin Ragothaman; Dileep Kalathil; Srinivas; Shakkottai

arXiv:2305.03097·cs.LG·October 7, 2024·2 cites

Federated Ensemble-Directed Offline Reinforcement Learning

Desik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas, Shakkottai

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces FEDORA, a federated ensemble-directed offline RL algorithm that effectively combines data from distributed agents, outperforming traditional methods in complex control tasks and real-world robotics applications.

Contribution

The paper proposes FEDORA, a novel ensemble-based federated offline RL algorithm that improves policy quality by distilling collective knowledge from distributed datasets.

Findings

01

FEDORA outperforms standard offline RL and federated approaches in continuous control tasks.

02

FEDORA achieves superior results on real-world datasets and a mobile robot.

03

The code and experiments are publicly available for reproducibility.

Abstract

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Na\"{i}vely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

desikrengarajan/fedora
pytorchOfficial

Videos

Federated Ensemble-Directed Offline Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations