Privacy Preserving Reinforcement Learning for Population Processes
Samuel Yang-Zhao, Kee Siong Ng

TL;DR
This paper introduces a method for applying differential privacy to reinforcement learning in population processes, ensuring privacy while maintaining utility, demonstrated through epidemic control simulations.
Contribution
It provides a meta algorithm to make any RL algorithm differentially private in population settings, with theoretical guarantees on utility loss.
Findings
Privacy-utility trade-offs are feasible in large populations.
Value-function approximation error decreases with population size and privacy budget.
Experimental validation on epidemic control shows practical effectiveness.
Abstract
We consider the problem of privacy protection in Reinforcement Learning (RL) algorithms that operate over population processes, a practical but understudied setting that includes, for example, the control of epidemics in large populations of dynamically interacting individuals. In this setting, the RL algorithm interacts with the population over time steps by receiving population-level statistics as state and performing actions which can affect the entire population at each time step. An individual's data can be collected across multiple interactions and their privacy must be protected at all times. We clarify the Bayesian semantics of Differential Privacy (DP) in the presence of correlated data in population processes through a Pufferfish Privacy analysis. We then give a meta algorithm that can take any RL algorithm as input and make it differentially private. This is achieved by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Innovation Diffusion and Forecasting
