Learning User Preferences for Image Generation Model

Wenyi Mo; Ying Ba; Tianyu Zhang; Yalong Bai; Biye Li

arXiv:2508.08220·cs.CV·August 12, 2025

Learning User Preferences for Image Generation Model

Wenyi Mo, Ying Ba, Tianyu Zhang, Yalong Bai, Biye Li

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper introduces a novel approach using multimodal large language models with contrastive preference loss and preference tokens to improve personalized image generation by accurately capturing individual user tastes.

Contribution

It presents a new method that models personalized user preferences through contrastive learning and preference tokens, addressing limitations of static and general preference assumptions.

Findings

01

Outperforms existing methods in preference prediction accuracy

02

Effectively identifies users with similar aesthetic tastes

03

Provides more precise image generation aligned with individual preferences

Abstract

User preference prediction requires a comprehensive and accurate understanding of individual tastes. This includes both surface-level attributes, such as color and style, and deeper content-related aspects, such as themes and composition. However, existing methods typically rely on general human preferences or assume static user profiles, often neglecting individual variability and the dynamic, multifaceted nature of personal taste. To address these limitations, we propose an approach built upon Multimodal Large Language Models, introducing contrastive preference loss and preference tokens to learn personalized user preferences from historical interactions. The contrastive preference loss is designed to effectively distinguish between user ''likes'' and ''dislikes'', while the learnable preference tokens capture shared interest representations among existing users, enabling the model to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
wenyii/learn-user-pref
model· ♡ 3
♡ 3

Datasets

wenyii/learn-user-pref
dataset· 80 dl
80 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques