Rethinking Prompt Optimizers: From Prompt Merits to Optimization
Zixiao Zhu, Hanzhang Zhou, Zijian Feng, Tianjiao Li, Chua Jia Jim Deryl, Mak Lee Onn, Gee Wah Ng, Kezhi Mao

TL;DR
This paper proposes MePO, an interpretable, merit-guided prompt optimizer trained on a lightweight LLM dataset, which improves prompt quality and response accuracy across various models without online optimization.
Contribution
It introduces a novel, model-agnostic prompt optimization method based on explicit quality merits, enhancing interpretability and generalization for diverse LLMs.
Findings
MePO outperforms existing prompt optimization methods across multiple tasks.
It effectively generalizes to both large-scale and lightweight models.
The approach reduces privacy concerns by avoiding online optimization.
Abstract
Prompt optimization (PO) provides a practical way to improve response quality when users lack the time or expertise to manually craft effective prompts. Existing methods typically rely on LLMs' self-generation ability to optimize prompts. However, due to limited downward compatibility, the instruction-heavy prompts generated by advanced LLMs can overwhelm lightweight inference models and degrade response quality, while also lacking interpretability due to implicit optimization. In this work, we rethink prompt optimization through the lens of explicit and interpretable design. We first identify a set of model-agnostic prompt quality merits and empirically validate their effectiveness in enhancing prompt and response quality. We then introduce MePO, a merit-guided, locally deployable prompt optimizer trained on our merit-guided prompt preference dataset generated by a lightweight LLM.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Innovative Human-Technology Interaction · Green IT and Sustainability
MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Dropout · Residual Connection · Multi-Head Attention · Dense Connections · Layer Normalization · Byte Pair Encoding
