DE-COP: Detecting Copyrighted Content in Language Models Training Data

Andr\'e V. Duarte; Xuandong Zhao; Arlindo L. Oliveira; Lei Li

arXiv:2402.09910·cs.CL·June 26, 2024·3 cites

DE-COP: Detecting Copyrighted Content in Language Models Training Data

Andr\'e V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei Li

PDF

Open Access 2 Repos

TL;DR

DE-COP is a novel method that detects copyrighted content in language model training data by probing models with multiple-choice questions including verbatim and paraphrased excerpts, outperforming previous approaches.

Contribution

The paper introduces DE-COP, a new probing technique and a benchmark dataset for identifying copyrighted material in language models, achieving significant improvements over prior methods.

Findings

01

DE-COP surpasses previous methods by 9.6% in AUC on models with logits.

02

DE-COP achieves 72% accuracy in detecting suspect books on black-box models.

03

The benchmark BookTection includes excerpts from 165 books for evaluation.

Abstract

How can we detect if copyrighted content was used in the training process of a language model, considering that the training data is typically undisclosed? We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text. We propose DE-COP, a method to determine whether a piece of copyrighted content was included in training. DE-COP's core approach is to probe an LLM with multiple-choice questions, whose options include both verbatim text and their paraphrases. We construct BookTection, a benchmark with excerpts from 165 books published prior and subsequent to a model's training cutoff, along with their paraphrases. Our experiments show that DE-COP surpasses the prior best method by 9.6% in detection performance (AUC) on models with logits available. Moreover, DE-COP also achieves an average accuracy of 72% for detecting suspect books…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques