Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior

Bo Yin; Qi Li; Runpeng Yu; Xinchao Wang

arXiv:2601.01966·cs.LG·January 6, 2026

Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior

Bo Yin, Qi Li, Runpeng Yu, Xinchao Wang

PDF

Open Access

TL;DR

This paper introduces RePro, a method to detect whether training prompts for language models were original or refined by an external refiner, using stable shifts in model output distributions for dataset governance and dispute resolution.

Contribution

The paper formalizes the Refinement Provenance Inference task and proposes RePro, a logit-based framework that effectively detects refined prompts across models without access to training data.

Findings

01

RePro achieves strong performance in provenance inference.

02

RePro transfers well across different refiners.

03

Detection relies on stable distribution shifts rather than rewrite artifacts.

Abstract

Instruction tuning increasingly relies on LLM-based prompt refinement, where prompts in the training corpus are selectively rewritten by an external refiner to improve clarity and instruction alignment. This motivates an instance-level audit problem: for a fine-tuned model and a training prompt-response pair, can we infer whether the model was trained on the original prompt or its LLM-refined version within a mixed corpus? This matters for dataset governance and dispute resolution when training data are contested. However, it is non-trivial in practice: refined and raw instances are interleaved in the training corpus with unknown, source-dependent mixture ratios, making it harder to develop provenance methods that generalize across models and training setups. In this paper, we formalize this audit task as Refinement Provenance Inference (RPI) and show that prompt refinement yields…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science