Many-Shot Regurgitation (MSR) Prompting
Shashank Sonkar, Richard G. Baraniuk

TL;DR
This paper introduces Many-Shot Regurgitation (MSR) prompting, a black-box attack method to detect verbatim content reproduction in large language models by analyzing output patterns across different datasets.
Contribution
The paper presents MSR prompting as a novel framework for membership inference, revealing significant verbatim reproduction differences based on training data exposure in LLMs.
Findings
Higher verbatim match frequency in models' training data
Significant statistical differences between pre- and post-training datasets
Evidence of increased content reproduction when prompted with familiar data
Abstract
We introduce Many-Shot Regurgitation (MSR) prompting, a new black-box membership inference attack framework for examining verbatim content reproduction in large language models (LLMs). MSR prompting involves dividing the input text into multiple segments and creating a single prompt that includes a series of faux conversation rounds between a user and a language model to elicit verbatim regurgitation. We apply MSR prompting to diverse text sources, including Wikipedia articles and open educational resources (OER) textbooks, which provide high-quality, factual content and are continuously updated over time. For each source, we curate two dataset types: one that LLMs were likely exposed to during training () and another consisting of documents published after the models' training cutoff dates (). To quantify the occurrence of verbatim matches, we employ the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Discriminative Fine-Tuning · Cosine Annealing · Dropout · Linear Warmup With Cosine Annealing · Residual Connection · Byte Pair Encoding · Adam · Softmax
