TL;DR
SPARKE introduces a scalable, prompt-aware diversity guidance method for diffusion models that enhances diversity in generated images while maintaining computational efficiency, especially in large-scale prompt settings.
Contribution
The paper proposes SPARKE, a novel entropy-based diversity guidance method that reduces computational complexity from O(n^3) to O(n), enabling prompt-aware diversity control in large-scale diffusion model generation.
Findings
Improves prompt-aware diversity in diffusion model outputs.
Reduces entropy computation complexity from cubic to linear scale.
Demonstrates effectiveness on multiple text-to-image diffusion models.
Abstract
Diffusion models have demonstrated remarkable success in high-fidelity image synthesis and prompt-guided generative modeling. However, ensuring adequate diversity in generated samples of prompt-guided diffusion models remains a challenge, particularly when the prompts span a broad semantic spectrum and the diversity of generated data needs to be evaluated in a prompt-aware fashion across semantically similar prompts. Recent methods have introduced guidance via diversity measures to encourage more varied generations. In this work, we extend the diversity measure-based approaches by proposing the Scalable Prompt-Aware R\'eny Kernel Entropy Diversity Guidance (SPARKE) method for prompt-aware diversity guidance. SPARKE utilizes conditional entropy for diversity guidance, which dynamically conditions diversity measurement on similar prompts and enables prompt-aware diversity control. While…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
MethodsFocus · Diffusion
