Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases
Elad Levi, Eli Brosh, Matan Friedmann

TL;DR
This paper presents a novel automatic prompt calibration method that iteratively refines prompts by generating synthetic boundary cases, improving performance on real-world tasks without extensive labeled data.
Contribution
Introduces a modular, synthetic boundary case-based prompt calibration approach that enhances prompt optimization for LLMs, outperforming existing methods with limited annotated samples.
Findings
Outperforms state-of-the-art prompt engineering methods
Effective with limited annotated data
Modular system adaptable to various tasks
Abstract
Prompt engineering is a challenging and important task due to the high sensitivity of Large Language Models (LLMs) to the given prompt and the inherent ambiguity of a textual task instruction. Automatic prompt engineering is essential to achieve optimized performance from LLMs. Recent studies have demonstrated the capabilities of LLMs to automatically conduct prompt engineering by employing a meta-prompt that incorporates the outcomes of the last trials and proposes an improved prompt. However, this requires a high-quality benchmark to compare different prompts, which is difficult and expensive to acquire in many real-world use cases. In this work, we introduce a new method for automatic prompt engineering, using a calibration process that iteratively refines the prompt to the user intent. During the optimization process, the system jointly generates synthetic data of boundary use cases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Model Reduction and Neural Networks · Control Systems and Identification
