Preserving Expert-Level Privacy in Offline Reinforcement Learning

Navodita Sharma; Vishnu Vinod; Abhradeep Thakurta; Alekh Agarwal; Borja Balle; Christoph Dann; Aravindan Raghuveer

arXiv:2411.13598·cs.CR·November 25, 2025

Preserving Expert-Level Privacy in Offline Reinforcement Learning

Navodita Sharma, Vishnu Vinod, Abhradeep Thakurta, Alekh Agarwal, Borja Balle, Christoph Dann, Aravindan Raghuveer

PDF

Open Access

TL;DR

This paper introduces a privacy-preserving offline reinforcement learning method that guarantees expert privacy through differential privacy, while maintaining strong empirical performance across complex environments.

Contribution

It proposes a novel consensus-based differentially private offline RL approach compatible with existing algorithms, with rigorous privacy guarantees and demonstrated empirical effectiveness.

Findings

01

Achieves differential privacy guarantees in offline RL.

02

Maintains strong empirical performance on complex environments.

03

Outperforms baseline methods in privacy-preserving settings.

Abstract

The offline reinforcement learning (RL) problem aims to learn an optimal policy from historical data collected by one or more behavioural policies (experts) by interacting with an environment. However, the individual experts may be privacy-sensitive in that the learnt policy may retain information about their precise choices. In some domains like personalized retrieval, advertising and healthcare, the expert choices are considered sensitive data. To provably protect the privacy of such experts, we propose a novel consensus-based expert-level differentially private offline RL training approach compatible with any existing offline RL algorithm. We prove rigorous differential privacy guarantees, while maintaining strong empirical performance. Unlike existing work in differentially private RL, we supplement the theory with proof-of-concept experiments on classic RL environments featuring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI · Privacy, Security, and Data Protection