Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt   Optimization for Enhanced Text-to-Image Synthesis

Xinrui Yang; Zhuohan Wang; Anthony Hu

arXiv:2406.08713·cs.AI·June 14, 2024

Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis

Xinrui Yang, Zhuohan Wang, Anthony Hu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-agent prompt optimization framework that iteratively refines prompts for text-to-image models using feedback mechanisms, improving image quality through systematic prompt evolution.

Contribution

It presents a novel multi-agent system with dynamic prompt refinement and feedback loops, enhancing prompt design for text-to-image synthesis beyond existing direct interaction methods.

Findings

01

Effective prompt refinement improves image quality.

02

Iterative feedback enhances prompt relevance.

03

System components significantly impact performance.

Abstract

Text-to-image models have shown remarkable progress in generating high-quality images from user-provided prompts. Despite this, the quality of these images varies due to the models' sensitivity to human language nuances. With advancements in large language models, there are new opportunities to enhance prompt design for image generation tasks. Existing research primarily focuses on optimizing prompts for direct interaction, while less attention is given to scenarios involving intermediary agents, like the Stable Diffusion model. This study proposes a Multi-Agent framework to optimize input prompts for text-to-image generation models. Central to this framework is a prompt generation mechanism that refines initial queries using dynamic instructions, which evolve through iterative performance feedback. High-quality prompts are then fed into a state-of-the-art text-to-image model. A…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slark-prime/maai_optimizer
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques

MethodsDiffusion