LLM Dataset Inference: Did you train on my dataset?

Pratyush Maini; Hengrui Jia; Nicolas Papernot; Adam Dziedzic

arXiv:2406.06443·cs.LG·June 11, 2024·2 cites

LLM Dataset Inference: Did you train on my dataset?

Pratyush Maini, Hengrui Jia, Nicolas Papernot, Adam Dziedzic

PDF

Open Access 1 Repo

TL;DR

This paper critically examines membership inference attacks on large language models, revealing their limitations due to distribution shifts, and introduces a new dataset inference method that accurately identifies training datasets without false positives.

Contribution

The paper demonstrates the confounding effect of distribution shifts on MIAs and proposes a novel dataset inference technique that reliably detects training data used for LLMs.

Findings

01

MIAs are confounded by distribution shifts, reducing their effectiveness.

02

Most MIAs perform no better than random guessing within the same distribution.

03

The proposed dataset inference method accurately identifies training datasets with significant statistical confidence.

Abstract

The proliferation of large language models (LLMs) in the real world has come with a rise in copyright cases against companies for training their models on unlicensed data from the internet. Recent works have presented methods to identify if individual text sequences were members of the model's training data, known as membership inference attacks (MIAs). We demonstrate that the apparent success of these MIAs is confounded by selecting non-members (text sequences not used for training) belonging to a different distribution from the members (e.g., temporally shifted recent Wikipedia articles compared with ones used to train the model). This distribution shift makes membership inference appear successful. However, most MIA methods perform no better than random guessing when discriminating between members and non-members from the same distribution (e.g., in this case, the same period of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pratyushmaini/llm_dataset_inference
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Machine Learning and Data Classification · AI in cancer detection