Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large   Language Models

Jingyang Zhang; Jingwei Sun; Eric Yeats; Yang Ouyang; Martin Kuo,; Jianyi Zhang; Hao Frank Yang; Hai Li

arXiv:2404.02936·cs.CL·February 13, 2025·3 cites

Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

Jingyang Zhang, Jingwei Sun, Eric Yeats, Yang Ouyang, Martin Kuo,, Jianyi Zhang, Hao Frank Yang, Hai Li

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces Min-K%++, a theoretically grounded method for detecting pre-training data in large language models, significantly outperforming previous heuristics and state-of-the-art techniques across multiple benchmarks.

Contribution

The paper proposes a novel, theoretically motivated approach for pre-training data detection based on local maxima identification in LLMs, advancing beyond heuristic-based methods.

Findings

01

Achieves new state-of-the-art detection performance on WikiMIA benchmark.

02

Consistently outperforms reference-free methods on MIMIR benchmark.

03

Performs comparably to reference-based methods requiring extra models.

Abstract

The problem of pre-training data detection for large language models (LLMs) has received growing attention due to its implications in critical issues like copyright violation and test data contamination. Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose a novel and theoretically motivated methodology for pre-training data detection, named Min-K%++. Specifically, we present a key insight that training samples tend to be local maxima of the modeled distribution along each input dimension through maximum likelihood training, which in turn allow us to insightfully translate the problem into identification of local maxima. Then, we design our method accordingly that works under the discrete distribution modeled by LLMs, whose core idea is to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zjysteven/mink-plus-plus
pytorch

Datasets

zjysteven/WikiMIA_paraphrased_perturbed
dataset· 228 dl
228 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques