A Sequential Optimal Learning Approach to Automated Prompt Engineering   in Large Language Models

Shuyang Wang; Somayeh Moazeni; Diego Klabjan

arXiv:2501.03508·cs.CL·January 8, 2025

A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models

Shuyang Wang, Somayeh Moazeni, Diego Klabjan

PDF

Open Access

TL;DR

This paper introduces a sequential optimal learning framework for automated prompt engineering in large language models, utilizing Bayesian regression and the Knowledge-Gradient policy to efficiently identify effective prompts with limited evaluations.

Contribution

It presents a novel feature-based prompt representation combined with an efficient sequential learning method using KG policy, improving prompt optimization under evaluation constraints.

Findings

01

Outperforms benchmark strategies in instruction induction tasks.

02

Efficiently explores large prompt feature spaces with limited evaluations.

03

Demonstrates scalability through mixed-integer second-order cone optimization.

Abstract

Designing effective prompts is essential to guiding large language models (LLMs) toward desired responses. Automated prompt engineering aims to reduce reliance on manual effort by streamlining the design, refinement, and optimization of natural language prompts. This paper proposes an optimal learning framework for automated prompt engineering, designed to sequentially identify effective prompt features while efficiently allocating a limited evaluation budget. We introduce a feature-based method to express prompts, which significantly broadens the search space. Bayesian regression is employed to utilize correlations among similar prompts, accelerating the learning process. To efficiently explore the large space of prompt features for a high quality prompt, we adopt the forward-looking Knowledge-Gradient (KG) policy for sequential optimal learning. The KG policy is computed efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsADaptive gradient method with the OPTimal convergence rate · Sparse Evolutionary Training