Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
Jingyang Zhang, Jingwei Sun, Eric Yeats, Yang Ouyang, Martin Kuo,, Jianyi Zhang, Hao Frank Yang, Hai Li

TL;DR
This paper introduces Min-K%++, a theoretically grounded method for detecting pre-training data in large language models, significantly outperforming previous heuristics and state-of-the-art techniques across multiple benchmarks.
Contribution
The paper proposes a novel, theoretically motivated approach for pre-training data detection based on local maxima identification in LLMs, advancing beyond heuristic-based methods.
Findings
Achieves new state-of-the-art detection performance on WikiMIA benchmark.
Consistently outperforms reference-free methods on MIMIR benchmark.
Performs comparably to reference-based methods requiring extra models.
Abstract
The problem of pre-training data detection for large language models (LLMs) has received growing attention due to its implications in critical issues like copyright violation and test data contamination. Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose a novel and theoretically motivated methodology for pre-training data detection, named Min-K%++. Specifically, we present a key insight that training samples tend to be local maxima of the modeled distribution along each input dimension through maximum likelihood training, which in turn allow us to insightfully translate the problem into identification of local maxima. Then, we design our method accordingly that works under the discrete distribution modeled by LLMs, whose core idea is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
