Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas, Shakkottai

TL;DR
This paper introduces FEDORA, a federated ensemble-directed offline RL algorithm that effectively combines data from distributed agents, outperforming traditional methods in complex control tasks and real-world robotics applications.
Contribution
The paper proposes FEDORA, a novel ensemble-based federated offline RL algorithm that improves policy quality by distilling collective knowledge from distributed datasets.
Findings
FEDORA outperforms standard offline RL and federated approaches in continuous control tasks.
FEDORA achieves superior results on real-world datasets and a mobile robot.
The code and experiments are publicly available for reproducibility.
Abstract
We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Na\"{i}vely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations
