A Neural Space-Time Representation for Text-to-Image Personalization
Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen-Or

TL;DR
This paper introduces a novel space-time neural representation for text-to-image personalization, enabling high-quality, controllable image generation with a compact model that does not require fine-tuning the generative model.
Contribution
It proposes a new implicit space-time concept representation using a neural mapper, improving personalization quality and control without fine-tuning the generative model.
Findings
Enhanced visual fidelity and convergence in personalized image generation.
Ability to control concept reconstruction and editability with a single model.
Outperforms existing methods in generating high-quality, personalized images.
Abstract
A key aspect of text-to-image personalization methods is the manner in which the target concept is represented within the generative process. This choice greatly affects the visual fidelity, downstream editability, and disk space needed to store the learned concept. In this paper, we explore a new text-conditioning space that is dependent on both the denoising process timestep (time) and the denoising U-Net layers (space) and showcase its compelling properties. A single concept in the space-time representation is composed of hundreds of vectors, one for each combination of time and space, making this space challenging to optimize directly. Instead, we propose to implicitly represent a concept in this space by optimizing a small neural mapper that receives the current time and space parameters and outputs the matching token embedding. In doing so, the entire personalized concept is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Image Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · Convolution · U-Net
