SerialGen: Personalized Image Generation by First Standardization Then Personalization
Cong Xie, Han Zou, Ruiqi Yu, Yan Zhang, Zhenpeng Zhan

TL;DR
SerialGen is a two-stage personalized image generation framework that standardizes reference images before generating consistent, text-controllable human characters, improving appearance fidelity and prompt responsiveness.
Contribution
The paper introduces SerialGen, a novel serial generation framework with a standardization stage and two modules, enhancing personalized image consistency and controllability.
Findings
Produces images with high appearance consistency
Accurately responds to diverse text prompts
Enhances serial image generation quality
Abstract
In this work, we are interested in achieving both high text controllability and whole-body appearance consistency in the generation of personalized human characters. We propose a novel framework, named SerialGen, which is a serial generation method consisting of two stages: first, a standardization stage that standardizes reference images, and then a personalized generation stage based on the standardized reference. Furthermore, we introduce two modules aimed at enhancing the standardization process. Our experimental results validate the proposed framework's ability to produce personalized images that faithfully recover the reference image's whole-body appearance while accurately responding to a wide range of text prompts. Through thorough analysis, we highlight the critical contribution of the proposed serial generation method and standardization model, evidencing enhancements in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Image Retrieval and Classification Techniques
