ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning

Linkang Du; Min Chen; Mingyang Sun; Shouling Ji; Peng Cheng; Jiming; Chen; Zhikun Zhang

arXiv:2309.03081·cs.CR·September 7, 2023

ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning

Linkang Du, Min Chen, Mingyang Sun, Shouling Ji, Peng Cheng, Jiming, Chen, Zhikun Zhang

PDF

1 Repo

TL;DR

This paper introduces ORL-AUDITOR, a novel dataset auditing method for offline deep reinforcement learning that uses cumulative rewards as unique identifiers, achieving over 95% accuracy in identifying datasets used for training.

Contribution

The paper presents the first trajectory-level dataset auditing mechanism for offline RL, addressing limitations of existing methods and demonstrating high accuracy and practicality.

Findings

01

Achieves over 95% auditing accuracy

02

False positive rate below 2.88%

03

Effective on datasets from Google and DeepMind

Abstract

Data is a critical asset in AI, as high-quality datasets can significantly improve the performance of machine learning models. In safety-critical domains such as autonomous vehicles, offline deep reinforcement learning (offline DRL) is frequently used to train models on pre-collected datasets, as opposed to training these models by interacting with the real-world environment as the online DRL. To support the development of these models, many institutions make datasets publicly available with opensource licenses, but these datasets are at risk of potential misuse or infringement. Injecting watermarks to the dataset may protect the intellectual property of the data, but it cannot handle datasets that have already been published and is infeasible to be altered afterward. Other existing solutions, such as dataset inference and membership inference, do not work well in the offline DRL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

link-zju/orl-auditor
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.