A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization
Ke Chen, Yifeng Wang, Hassan Almosapeeh, Haohan Wang

TL;DR
This paper introduces a unified, performance-oriented framework for query-dependent prompt optimization that uses a learned evaluator to guide prompt rewriting, resulting in more stable and interpretable improvements across various tasks.
Contribution
It proposes a comprehensive prompt evaluation framework and an execution-free evaluator that guides query-dependent prompt rewriting, addressing limitations of prior static and black-box methods.
Findings
Evaluator achieves high accuracy in predicting prompt performance.
Optimization outperforms static and query-dependent baselines on multiple datasets.
Framework provides stable, interpretable, and model-agnostic prompt improvements.
Abstract
Most prompt-optimization methods refine a single static template, making them ineffective in complex and dynamic user scenarios. Existing query-dependent approaches rely on unstable textual feedback or black-box reward models, providing weak and uninterpretable optimization signals. More fundamentally, prompt quality itself lacks a unified, systematic definition, resulting in fragmented and unreliable evaluation signals. Our approach first establishes a performance-oriented, systematic, and comprehensive prompt evaluation framework. Furthermore, we develop and finetune an execution-free evaluator that predicts multi-dimensional quality scores directly from text. The evaluator then instructs a metric-aware optimizer that diagnoses failure modes and rewrites prompts in an interpretable, query-dependent manner. Our evaluator achieves the strongest accuracy in predicting prompt performance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Quality and Management · Information Retrieval and Search Behavior
