Locally Private Distributed Reinforcement Learning
Hajime Ono, Tsubasa Takahashi

TL;DR
This paper introduces a novel distributed reinforcement learning algorithm that ensures local differential privacy for agents, enabling robust policy learning across private environments while protecting sensitive information.
Contribution
First to implement distributed reinforcement learning under local differential privacy, combining noisy gradient updates with a central aggregator for privacy-preserving policy training.
Findings
The method achieves effective policy learning under LDP constraints.
Empirical results show competitive performance with non-private RL methods.
Demonstrates feasibility of privacy-preserving RL in distributed settings.
Abstract
We study locally differentially private algorithms for reinforcement learning to obtain a robust policy that performs well across distributed private environments. Our algorithm protects the information of local agents' models from being exploited by adversarial reverse engineering. Since a local policy is strongly being affected by the individual environment, the output of the agent may release the private information unconsciously. In our proposed algorithm, local agents update the model in their environments and report noisy gradients designed to satisfy local differential privacy (LDP) that gives a rigorous local privacy guarantee. By utilizing a set of reported noisy gradients, a central aggregator updates its model and delivers it to different local agents. In our empirical evaluation, we demonstrate how our method performs well under LDP. To the best of our knowledge, this is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Auction Theory and Applications · Reinforcement Learning in Robotics
