Evolutionary Optimization Trumps Adam Optimization on Embedding Space Exploration
Dom\'icio Pereira Neto, Jo\~ao Correia, Penousal Machado

TL;DR
This paper demonstrates that a gradient-free evolutionary optimizer, sep-CMA-ES, outperforms Adam in prompt-embedding search for diffusion models, enhancing image quality and alignment without model fine-tuning.
Contribution
It introduces the use of sep-CMA-ES for prompt-embedding optimization in diffusion models, showing superior performance over Adam in various trade-offs.
Findings
sep-CMA-ES achieves higher objective scores than Adam across prompts.
Evolutionary optimization improves aesthetics-alignment trade-offs.
Resource usage for sep-CMA-ES is comparable or better than Adam.
Abstract
Deep diffusion models have revolutionized image generation by producing high-quality outputs. However, achieving specific objectives with these models often requires costly adaptations such as fine-tuning, which can be resource-intensive and time-consuming. An alternative approach is inference-time control, which involves optimizing the prompt embeddings to guide the generation process without altering the model weights. We explore prompt-embedding search optimization for the Stable Diffusion XL Turbo model, comparing a gradient-free evolutionary approach, the Separable Covariance Matrix Adaptation Evolution Strategy (sep-CMA-ES), against the widely used gradient-based optimizer Adaptive Moment Estimation (Adam). Candidate images are evaluated by a weighted objective that combines LAION Aesthetic Predictor V2 and CLIPScore, enabling explicit trade-offs between aesthetic quality and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
