Did the Neurons Read your Book? Document-level Membership Inference for   Large Language Models

Matthieu Meeus; Shubham Jain; Marek Rei; Yves-Alexandre de Montjoye

arXiv:2310.15007·cs.CL·July 17, 2024·1 cites

Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models

Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye

PDF

Open Access 2 Repos 1 Datasets

TL;DR

This paper introduces a method to determine if large language models have seen specific documents during training, revealing privacy risks and enhancing transparency of these models.

Contribution

It presents a novel black-box approach for document-level membership inference on LLMs, outperforming existing sentence-level attacks and evaluating mitigation strategies.

Findings

01

High accuracy in inferring document membership (AUC 0.856 for books)

02

Outperforms sentence-level inference attacks

03

Model sensitivity remains high even with smaller models and partial documents

Abstract

With large language models (LLMs) poised to become embedded in our daily lives, questions are starting to be raised about the data they learned from. These questions range from potential bias or misinformation LLMs could retain from their training data to questions of copyright and fair use of human-generated text. However, while these questions emerge, developers of the recent state-of-the-art LLMs become increasingly reluctant to disclose details on their training corpus. We here introduce the task of document-level membership inference for real-world LLMs, i.e. inferring whether the LLM has seen a given document during training or not. First, we propose a procedure for the development and evaluation of document-level membership inference for LLMs by leveraging commonly used data sources for training and the model release date. We then propose a practical, black-box method to predict…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

imperial-cpg/project-gutenberg-extended
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning