System Prompt Optimization with Meta-Learning
Yumin Choi, Jinheon Baek, Sung Ju Hwang

TL;DR
This paper introduces a meta-learning approach for optimizing system prompts in large language models, aiming for robustness and transferability across tasks and domains, and demonstrates improved generalization and rapid adaptation.
Contribution
It presents the first bilevel optimization framework for system prompt design using meta-learning, addressing transferability and robustness across diverse tasks.
Findings
Optimized system prompts generalize well to unseen datasets and domains.
The approach enables faster adaptation to new tasks with fewer optimization steps.
Improved performance over baseline prompt optimization methods.
Abstract
Large Language Models (LLMs) have shown remarkable capabilities, with optimizing their input prompts playing a pivotal role in maximizing their performance. However, while LLM prompts consist of both the task-agnostic system prompts and task-specific user prompts, existing work on prompt optimization has focused on user prompts specific to individual queries or tasks, and largely overlooked the system prompt that is, once optimized, applicable across different tasks and domains. Motivated by this, we introduce the novel problem of bilevel system prompt optimization, whose objective is to design system prompts that are robust to diverse user prompts and transferable to unseen tasks. To tackle this problem, we then propose a meta-learning framework, which meta-learns the system prompt by optimizing it over various user prompts across multiple datasets, while simultaneously updating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
