Federated Ensemble Model-based Reinforcement Learning in Edge Computing

Jin Wang; Jia Hu; Jed Mills; Geyong Min; and Ming Xia

arXiv:2109.05549·cs.LG·April 4, 2023·1 cites

Federated Ensemble Model-based Reinforcement Learning in Edge Computing

Jin Wang, Jia Hu, Jed Mills, Geyong Min, and Ming Xia

PDF

Open Access

TL;DR

This paper introduces a novel federated reinforcement learning algorithm that combines model-based methods and ensemble knowledge distillation to improve sample efficiency and provide theoretical guarantees in edge computing environments.

Contribution

It proposes the first integration of model-based RL and ensemble knowledge distillation into federated learning for reinforcement learning tasks, with proven monotonic improvement.

Findings

01

Achieves higher sample efficiency than classic model-free FRL algorithms.

02

Demonstrates robustness to heterogeneous client data.

03

Validates theoretical analysis through extensive experiments.

Abstract

Federated learning (FL) is a privacy-preserving distributed machine learning paradigm that enables collaborative training among geographically distributed and heterogeneous devices without gathering their data. Extending FL beyond the supervised learning models, federated reinforcement learning (FRL) was proposed to handle sequential decision-making problems in edge computing systems. However, the existing FRL algorithms directly combine model-free RL with FL, thus often leading to high sample complexity and lacking theoretical guarantees. To address the challenges, we propose a novel FRL algorithm that effectively incorporates model-based RL and ensemble knowledge distillation into FL for the first time. Specifically, we utilise FL and knowledge distillation to create an ensemble of dynamics models for clients, and then train the policy by solely using the ensemble model without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Transportation and Mobility Innovations

MethodsKnowledge Distillation