CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Han He, Qianchu Liu, Lei Xu, Chaitanya Shivade, Yi Zhang, Sundararajan, Srinivasan, Katrin Kirchhoff

TL;DR
CriSPO is a novel multi-aspect critique-suggestion framework that enhances prompt optimization for text generation tasks, leading to significant improvements across multiple metrics in summarization and question-answering datasets.
Contribution
The paper introduces CriSPO, a critique-suggestion guided approach with an automatic suffix tuning extension for multi-metric prompt optimization in generative language models.
Findings
Achieves 3-4% ROUGE score improvement in summarization.
Substantially improves various metrics in QA tasks.
Effective multi-aspect prompt refinement demonstrated across multiple datasets.
Abstract
Existing automatic prompt engineering methods are typically designed for discriminative tasks, where new task prompts are iteratively refined with limited feedback from a single metric reflecting a single aspect. However, these approaches are suboptimal for generative tasks, which require more nuanced guidance beyond a single numeric metric to improve the prompt and optimize multiple aspects of the generated text. To address these challenges, we propose a novel multi-aspect Critique-Suggestion-guided automatic Prompt Optimization (CriSPO) approach. CriSPO introduces a critique-suggestion module as its core component. This module spontaneously discovers aspects, and compares generated and reference texts across these aspects, providing specific suggestions for prompt modification. These clear critiques and actionable suggestions guide a receptive optimizer module to make more substantial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research
