Personalizing Text-to-Image Generation via Aesthetic Gradients

Victor Gallego

arXiv:2209.12330·cs.CV·September 27, 2022·5 cites

Personalizing Text-to-Image Generation via Aesthetic Gradients

Victor Gallego

PDF

Open Access 1 Repo

TL;DR

This paper introduces aesthetic gradients, a technique to customize text-to-image generation by steering the process towards user-defined aesthetics, validated through experiments with stable diffusion models and aesthetic datasets.

Contribution

It presents a novel method for personalizing diffusion-based image generation using aesthetic gradients guided by user-provided images.

Findings

01

Effective personalization of image aesthetics demonstrated

02

Qualitative and quantitative validation with stable diffusion models

03

Code released for reproducibility

Abstract

This work proposes aesthetic gradients, a method to personalize a CLIP-conditioned diffusion model by guiding the generative process towards custom aesthetics defined by the user from a set of images. The approach is validated with qualitative and quantitative experiments, using the recent stable diffusion model and several aesthetically-filtered datasets. Code is released at https://github.com/vicgalle/stable-diffusion-aesthetic-gradients

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vicgalle/stable-diffusion-aesthetic-gradients
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Aesthetic Perception and Analysis · Video Analysis and Summarization

MethodsDiffusion