A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models
Shuyang Wang, Somayeh Moazeni, Diego Klabjan

TL;DR
This paper introduces a sequential optimal learning framework for automated prompt engineering in large language models, utilizing Bayesian regression and the Knowledge-Gradient policy to efficiently identify effective prompts with limited evaluations.
Contribution
It presents a novel feature-based prompt representation combined with an efficient sequential learning method using KG policy, improving prompt optimization under evaluation constraints.
Findings
Outperforms benchmark strategies in instruction induction tasks.
Efficiently explores large prompt feature spaces with limited evaluations.
Demonstrates scalability through mixed-integer second-order cone optimization.
Abstract
Designing effective prompts is essential to guiding large language models (LLMs) toward desired responses. Automated prompt engineering aims to reduce reliance on manual effort by streamlining the design, refinement, and optimization of natural language prompts. This paper proposes an optimal learning framework for automated prompt engineering, designed to sequentially identify effective prompt features while efficiently allocating a limited evaluation budget. We introduce a feature-based method to express prompts, which significantly broadens the search space. Bayesian regression is employed to utilize correlations among similar prompts, accelerating the learning process. To efficiently explore the large space of prompt features for a high quality prompt, we adopt the forward-looking Knowledge-Gradient (KG) policy for sequential optimal learning. The KG policy is computed efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsADaptive gradient method with the OPTimal convergence rate · Sparse Evolutionary Training
