Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis
Xinrui Yang, Zhuohan Wang, Anthony Hu

TL;DR
This paper introduces a multi-agent prompt optimization framework that iteratively refines prompts for text-to-image models using feedback mechanisms, improving image quality through systematic prompt evolution.
Contribution
It presents a novel multi-agent system with dynamic prompt refinement and feedback loops, enhancing prompt design for text-to-image synthesis beyond existing direct interaction methods.
Findings
Effective prompt refinement improves image quality.
Iterative feedback enhances prompt relevance.
System components significantly impact performance.
Abstract
Text-to-image models have shown remarkable progress in generating high-quality images from user-provided prompts. Despite this, the quality of these images varies due to the models' sensitivity to human language nuances. With advancements in large language models, there are new opportunities to enhance prompt design for image generation tasks. Existing research primarily focuses on optimizing prompts for direct interaction, while less attention is given to scenarios involving intermediary agents, like the Stable Diffusion model. This study proposes a Multi-Agent framework to optimize input prompts for text-to-image generation models. Central to this framework is a prompt generation mechanism that refines initial queries using dynamic instructions, which evolve through iterative performance feedback. High-quality prompts are then fed into a state-of-the-art text-to-image model. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
MethodsDiffusion
