Federated Reinforcement Learning with Constraint Heterogeneity

Hao Jin; Liangyu Zhang; Zhihua Zhang

arXiv:2405.03236·cs.LG·May 7, 2024

Federated Reinforcement Learning with Constraint Heterogeneity

Hao Jin, Liangyu Zhang, Zhihua Zhang

PDF

Open Access

TL;DR

This paper introduces federated primal-dual policy optimization methods for reinforcement learning with multiple constraints across distributed agents, ensuring collaborative policy learning while respecting local constraint signals.

Contribution

It proposes novel federated primal-dual algorithms based on policy gradient methods, with convergence guarantees and practical implementations using NPG and PPO.

Findings

01

FedNPG achieves global convergence with rate (1/\u221A T)

02

FedPPO effectively handles complex tasks with deep neural networks

03

The methods address constraint heterogeneity in federated RL scenarios

Abstract

We study a Federated Reinforcement Learning (FedRL) problem with constraint heterogeneity. In our setting, we aim to solve a reinforcement learning problem with multiple constraints while $N$ training agents are located in $N$ different environments with limited access to the constraint signals and they are expected to collaboratively learn a policy satisfying all constraint signals. Such learning problems are prevalent in scenarios of Large Language Model (LLM) fine-tuning and healthcare applications. To solve the problem, we propose federated primal-dual policy optimization methods based on traditional policy gradient methods. Specifically, we introduce $N$ local Lagrange functions for agents to perform local policy updates, and these agents are then scheduled to periodically communicate on their local policies. Taking natural policy gradient (NPG) and proximal policy optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations · Traffic control and management

MethodsFocus