Prompt Expansion for Adaptive Text-to-Image Generation

Siddhartha Datta; Alexander Ku; Deepak Ramachandran; Peter Anderson

arXiv:2312.16720·cs.CV·December 29, 2023·1 cites

Prompt Expansion for Adaptive Text-to-Image Generation

Siddhartha Datta, Alexander Ku, Deepak Ramachandran, Peter Anderson

PDF

Open Access 1 Video

TL;DR

This paper introduces a Prompt Expansion framework that enhances text-to-image generation by producing diverse and high-quality images with less user effort, validated through human evaluations.

Contribution

It presents a novel Prompt Expansion model that optimizes expanded prompts to improve image diversity and quality in text-to-image generation.

Findings

01

Expanded prompts lead to more diverse images

02

Generated images are more aesthetically pleasing

03

Human evaluations favor the proposed method

Abstract

Text-to-image generation models are powerful but difficult to use. Users craft specific prompts to get better images, though the images can be repetitive. This paper proposes a Prompt Expansion framework that helps users generate high-quality, diverse images with less effort. The Prompt Expansion model takes a text query as input and outputs a set of expanded text prompts that are optimized such that when passed to a text-to-image model, generates a wider variety of appealing images. We conduct a human evaluation study that shows that images generated through Prompt Expansion are more aesthetically pleasing and diverse than those generated by baseline methods. Overall, this paper presents a novel and effective approach to improving the text-to-image generation experience.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Prompt Expansion for Adaptive Text-to-Image Generation· underline

Taxonomy

TopicsVideo Analysis and Summarization · Image Retrieval and Classification Techniques · Computer Graphics and Visualization Techniques

MethodsSparse Evolutionary Training