ViPer: Visual Personalization of Generative Models via Individual   Preference Learning

Sogand Salehi; Mahdi Shafiei; Teresa Yeo; Roman Bachmann; Amir Zamir

arXiv:2407.17365·cs.CV·July 25, 2024

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Sogand Salehi, Mahdi Shafiei, Teresa Yeo, Roman Bachmann, Amir Zamir

PDF

2 Models

TL;DR

ViPer introduces a method to personalize generative image models by learning individual preferences through user comments and guiding image generation accordingly, improving alignment with personal tastes.

Contribution

The paper presents a novel approach that captures user preferences via comments and uses language models to guide personalized image generation, reducing manual prompt engineering.

Findings

01

Generated images align better with individual preferences.

02

User satisfaction with personalized images increases.

03

Method outperforms baseline in preference alignment.

Abstract

Different users find different images generated for the same prompt desirable. This gives rise to personalized image generation which involves creating images aligned with an individual's visual preference. Current generative models are, however, unpersonalized, as they are tuned to produce outputs that appeal to a broad audience. Using them to generate images aligned with individual users relies on iterative manual prompt engineering by the user which is inefficient and undesirable. We propose to personalize the image generation process by first capturing the generic preferences of the user in a one-time process by inviting them to comment on a small selection of images, explaining why they like or dislike each. Based on these comments, we infer a user's structured liked and disliked visual attributes, i.e., their visual preference, using a large language model. These attributes are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.