Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization
Dongyu Chen, Jian Ma, Xianpeng Zhang, Lei Zhang, Haonan Lu, Chen Chen, Chuangchuang Wang, Kai Tang

TL;DR
The paper introduces HAPO, a hierarchical prompt optimization framework that improves prompt design for large language models by reducing manual effort, preventing prompt drift, and enhancing interpretability through dynamic attribution and semantic editing.
Contribution
HAPO presents a novel hierarchical optimization approach with dynamic attribution, semantic-unit editing, and multimodal support, advancing automated prompt engineering.
Findings
HAPO outperforms existing prompt optimization methods in efficiency.
HAPO maintains task performance without prompt drift.
HAPO is effective in multimodal and complex task scenarios.
Abstract
Optimization is fundamental across numerous disciplines, typically following an iterative process of refining an initial solution to enhance performance. This principle is equally critical in prompt engineering, where designing effective prompts for large language models constitutes a complex optimization challenge. A structured optimization approach requires automated or semi-automated procedures to develop improved prompts, thereby reducing manual effort, improving performance, and yielding an interpretable process. However, current prompt optimization methods often induce prompt drift, where new prompts fix prior failures but impair performance on previously successful tasks. Additionally, generating prompts from scratch can compromise interpretability. To address these limitations, this study proposes the Hierarchical Attribution Prompt Optimization (HAPO) framework, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Parallel Computing and Optimization Techniques
