Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
Bo Yin, Qi Li, Runpeng Yu, Xinchao Wang

TL;DR
This paper introduces RePro, a method to detect whether training prompts for language models were original or refined by an external refiner, using stable shifts in model output distributions for dataset governance and dispute resolution.
Contribution
The paper formalizes the Refinement Provenance Inference task and proposes RePro, a logit-based framework that effectively detects refined prompts across models without access to training data.
Findings
RePro achieves strong performance in provenance inference.
RePro transfers well across different refiners.
Detection relies on stable distribution shifts rather than rewrite artifacts.
Abstract
Instruction tuning increasingly relies on LLM-based prompt refinement, where prompts in the training corpus are selectively rewritten by an external refiner to improve clarity and instruction alignment. This motivates an instance-level audit problem: for a fine-tuned model and a training prompt-response pair, can we infer whether the model was trained on the original prompt or its LLM-refined version within a mixed corpus? This matters for dataset governance and dispute resolution when training data are contested. However, it is non-trivial in practice: refined and raw instances are interleaved in the training corpus with unknown, source-dependent mixture ratios, making it harder to develop provenance methods that generalize across models and training setups. In this paper, we formalize this audit task as Refinement Provenance Inference (RPI) and show that prompt refinement yields…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science
