Customizing Text-to-Image Models with a Single Image Pair

Maxwell Jones; Sheng-Yu Wang; Nupur Kumari; David Bau; Jun-Yan Zhu

arXiv:2405.01536·cs.CV·October 29, 2024

Customizing Text-to-Image Models with a Single Image Pair

Maxwell Jones, Sheng-Yu Wang, Nupur Kumari, David Bau, Jun-Yan Zhu

PDF

Open Access

TL;DR

This paper introduces Pair Customization, a method that learns stylistic differences from a single image pair to customize text-to-image models, enabling style transfer without overfitting to specific content.

Contribution

It proposes a novel approach that captures stylistic differences from one image pair and applies them to generate styled images, avoiding overfitting and preserving content.

Findings

01

Effective style learning from a single image pair

02

Avoids overfitting to specific image content

03

Improves style transfer quality in text-to-image models

Abstract

Art reinterpretation is the practice of creating a variation of a reference work, making a paired artwork that exhibits a distinct artistic style. We ask if such an image pair can be used to customize a generative model to capture the demonstrated stylistic difference. We propose Pair Customization, a new customization method that learns stylistic difference from a single image pair and then applies the acquired style to the generation process. Unlike existing methods that learn to mimic a single concept from a collection of images, our method captures the stylistic difference between paired images. This allows us to apply a stylistic change without overfitting to the specific image content in the examples. To address this new task, we employ a joint optimization method that explicitly separates the style and content into distinct LoRA weight spaces. We optimize these style and content…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Modeling in Geospatial Applications

MethodsDiffusion