PromptIQ: Who Cares About Prompts? Let System Handle It -- A Component-Aware Framework for T2I Generation

Nisan Chhetri; Arpan Sainju

arXiv:2505.06467·cs.CV·May 13, 2025

PromptIQ: Who Cares About Prompts? Let System Handle It -- A Component-Aware Framework for T2I Generation

Nisan Chhetri, Arpan Sainju

PDF

Open Access

TL;DR

PromptIQ is an automated framework that refines prompts and evaluates image quality in text-to-image generation, addressing structural issues and reducing the need for manual prompt tuning.

Contribution

It introduces a novel Component-Aware Similarity metric and an iterative process for prompt refinement and quality assessment in T2I models.

Findings

01

Significantly improves image quality in T2I generation.

02

Enhances evaluation accuracy over existing metrics.

03

Reduces user effort in prompt engineering.

Abstract

Generating high-quality images without prompt engineering expertise remains a challenge for text-to-image (T2I) models, which often misinterpret poorly structured prompts, leading to distortions and misalignments. While humans easily recognize these flaws, metrics like CLIP fail to capture structural inconsistencies, exposing a key limitation in current evaluation methods. To address this, we introduce PromptIQ, an automated framework that refines prompts and assesses image quality using our novel Component-Aware Similarity (CAS) metric, which detects and penalizes structural errors. Unlike conventional methods, PromptIQ iteratively generates and evaluates images until the user is satisfied, eliminating trial-and-error prompt tuning. Our results show that PromptIQ significantly improves generation quality and evaluation accuracy, making T2I models more accessible for users with little…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Image Processing Techniques

MethodsContrastive Language-Image Pre-training