RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline
Andr\'e V. Duarte, Xuying li, Bin Zeng, Arlindo L. Oliveira, Lei Li, Zhuo Li

TL;DR
RECAP is an innovative pipeline that uses an iterative, feedback-driven approach with a secondary model to extract and verify memorized copyrighted content from large language models, improving extraction accuracy.
Contribution
The paper introduces RECAP, a novel agentic pipeline that enhances the extraction of memorized data from LLMs through iterative feedback and a jailbreaking module, advancing data verification methods.
Findings
Significant improvement in extraction accuracy with RECAP over single-iteration methods.
Achieved a 24% increase in ROUGE-L score on the EchoTrace benchmark.
Demonstrated effectiveness on GPT-4.1 with over 30 full books.
Abstract
If we cannot inspect the training data of a large language model (LLM), how can we ever know what it has seen? We believe the most compelling evidence arises when the model itself freely reproduces the target content. As such, we propose RECAP, an agentic pipeline designed to elicit and verify memorized training data from LLM outputs. At the heart of RECAP is a feedback-driven loop, where an initial extraction attempt is evaluated by a secondary language model, which compares the output against a reference passage and identifies discrepancies. These are then translated into minimal correction hints, which are fed back into the target model to guide subsequent generations. In addition, to address alignment-induced refusals, RECAP includes a jailbreaking module that detects and overcomes such barriers. We evaluate RECAP on EchoTrace, a new benchmark spanning over 30 full books, and the…
Peer Reviews
Decision·Submitted to ICLR 2026
- The dataset used is comprehensive enough to effectively demonstrate that the proposed method can successfully guide the LLM to reveal memorized content. - The experiments are thorough and examine the method from multiple angles, including interesting phenomena like how popular or “welcome” certain memorized content tends to be.
- I’m really unsure about the writing style of this work. The method described in the main text feels more like a high-level idea, very rough and vague. Almost all the actual details are buried in the appendix. I’m not sure if this kind of writing is acceptable, but honestly, it made it super hard for me to understand the method clearly, so hard that I couldn’t even judge whether the approach is reliable or not. I think the appendix should only support the main text, not carry most of the tech
1. The EchoTrace dataset is a valuable resource for future work. It covers diverse text types (public-domain, copyrighted, and unseen books). The segmentation and event summaries make it easy to test new elicitation or membership-inference methods. 2. The proposed RECAP method directly addresses both identified challenges. The results are clear and statistically grounded, showing strong and consistent improvements across four major model families.
In Prefix-Prompting baselines, longer or more detailed prefixes can sometimes lead to stronger verbatim reproduction. It would strengthen the paper if the authors analyzed whether prompt length differences contribute to RECAP’s performance gains.
1.The feedback loop (via Feedback Agent) iteratively refines extractions without injecting excessive external information, reducing false positives. 2.The jailbreaking module effectively circumvents alignment-induced refusals, addressing a key limitation of prior methods like Prefix-Probing and Dynamic Soft Prompting (DSP).The hybrid memorization score filtering balances extraction quality and cost efficiency, addressing the practicality of iterative pipelines. 3.The experiments are comprehensiv
1.The benchmark overrepresents popular works for both public domain and copyrighted categories. This may overestimate RECAP's performance on less mainstream, rarely scraped texts—critical for assessing real-world applicability. 2.Non-training data is limited to 5 books released in 2025, with no diversity in genre or timeframes. This makes it hard to validate RECAP's robustness against false positives across varied non-training scenarios. 3.The jailbreaking module relies on a single static hand-c
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
