IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Yuxin Zhang, Minyan Luo, Weiming Dong, Xiao Yang, Haibin Huang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu

TL;DR
IP-Prompter introduces a training-free, dynamic visual prompting method that enables theme-specific image generation by directly leveraging reference images, improving diversity, consistency, and style alignment without additional training.
Contribution
The paper proposes a novel training-free visual prompting technique with dynamic optimization for theme-specific image generation, bypassing the need for fine-tuning and enhancing flexibility.
Findings
Outperforms state-of-the-art personalization methods in quality and consistency.
Enables diverse applications like story generation and style transfer.
Maintains character identity and style coherence effectively.
Abstract
The stories and characters that captivate us as we grow up shape unique fantasy worlds, with images serving as the primary medium for visually experiencing these realms. Personalizing generative models through fine-tuning with theme-specific data has become a prevalent approach in text-to-image generation. However, unlike object customization, which focuses on learning specific objects, theme-specific generation encompasses diverse elements such as characters, scenes, and objects. Such diversity also introduces a key challenge: how to adaptively generate multi-character, multi-concept, and continuous theme-specific images (TSI). Moreover, fine-tuning approaches often come with significant computational overhead, time costs, and risks of overfitting. This paper explores a fundamental question: Can image generation models directly leverage images as contextual input, similarly to how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Artificial Intelligence in Games · Digital Storytelling and Education
