CAP: Evaluation of Persuasive and Creative Image Generation
Aysan Aghazadeh, Adriana Kovashka

TL;DR
This paper introduces three novel metrics to evaluate creativity, alignment, and persuasiveness in generated advertisement images, addressing gaps in current evaluation methods and proposing improvements for Text-to-Image models.
Contribution
The paper presents new evaluation metrics for assessing creativity, alignment, and persuasiveness in generated images and proposes an approach to improve T2I model performance in these aspects.
Findings
Current T2I models struggle with implicit prompts.
Models have difficulty generating creative and persuasive images.
Proposed approach enhances alignment, creativity, and persuasiveness.
Abstract
We address the task of advertisement image generation and introduce three evaluation metrics to assess Creativity, prompt Alignment, and Persuasiveness (CAP) in generated advertisement images. Despite recent advancements in Text-to-Image (T2I) generation and their performance in generating high-quality images for explicit descriptions, evaluating these models remains challenging. Existing evaluation methods focus largely on assessing alignment with explicit, detailed descriptions, but evaluating alignment with visually implicit prompts remains an open problem. Additionally, creativity and persuasiveness are essential qualities that enhance the effectiveness of advertisement images, yet are seldom measured. To address this, we propose three novel metrics for evaluating the creativity, alignment, and persuasiveness of generated images. Our findings reveal that current T2I models struggle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis
MethodsFocus
