Autonomous Prompt Engineering in Large Language Models
Daan Kepel, Konstantina Valogianni

TL;DR
This paper presents APET, a toolbox enabling GPT-4 to autonomously optimize prompts using advanced strategies, significantly improving performance on various tasks without external data, marking a major advancement in autonomous AI systems.
Contribution
Introduction of APET, a novel framework allowing GPT-4 to autonomously perform prompt engineering, enhancing task performance through sophisticated strategies without external data.
Findings
Improved task performance in Word Sorting (+4.4%) and Geometric Shapes (+6.8%)
Demonstrated autonomous prompt optimization without external data
Identified challenges in complex tasks like Checkmate in One (-14.8%)
Abstract
Prompt engineering is a crucial yet challenging task for optimizing the performance of large language models (LLMs) on customized tasks. This pioneering research introduces the Automatic Prompt Engineering Toolbox (APET), which enables GPT-4 to autonomously apply prompt engineering techniques. By leveraging sophisticated strategies such as Expert Prompting, Chain of Thought, and Tree of Thoughts, APET empowers GPT-4 to dynamically optimize prompts, resulting in substantial improvements in tasks like Word Sorting (4.4% increase) and Geometric Shapes (6.8% increase). Despite encountering challenges in complex tasks such as Checkmate in One (-14.8%), these findings demonstrate the transformative potential of APET in automating complex prompt optimization processes without the use of external data. Overall, this research represents a significant leap in AI development, presenting a robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections
