Efficient Prompt Optimization Through the Lens of Best Arm Identification
Chengshuai Shi, Kun Yang, Zihan Chen, Jundong Li, Jing Yang, Cong, Shen

TL;DR
This paper introduces TRIPLE, a novel framework for prompt selection in large language models that optimizes the process under explicit budget constraints by leveraging techniques from multi-armed bandit best arm identification.
Contribution
It establishes a new connection between prompt optimization and fixed-budget best arm identification, enabling more efficient prompt selection under cost constraints.
Findings
TRIPLE outperforms baseline methods in various tasks.
It effectively incorporates budget constraints into prompt selection.
Extensions of TRIPLE improve few-shot prompt example selection.
Abstract
The remarkable instruction-following capability of large language models (LLMs) has sparked a growing interest in automatically finding good prompts, i.e., prompt optimization. Most existing works follow the scheme of selecting from a pre-generated pool of candidate prompts. However, these designs mainly focus on the generation strategy, while limited attention has been paid to the selection method. Especially, the cost incurred during the selection (e.g., accessing LLM and evaluating the responses) is rarely explicitly considered. To overcome this limitation, this work provides a principled framework, TRIPLE, to efficiently perform prompt selection under an explicit budget constraint. TRIPLE is built on a novel connection established between prompt optimization and fixed-budget best arm identification (BAI-FB) in multi-armed bandits (MAB); thus, it is capable of leveraging the rich…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRobot Manipulation and Learning · Intelligent Tutoring Systems and Adaptive Learning
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Focus · Cosine Annealing · Attention Is All You Need · Dropout · Linear Layer · Linear Warmup With Cosine Annealing · Dense Connections · Softmax · Weight Decay
