Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models
JungWoo Chae, Jiyoon Kim, Sangheum Hwang

TL;DR
This paper introduces a parallel rescaling technique for personalized diffusion models that improves prompt alignment and visual fidelity without extra training data, addressing overfitting and misalignment issues in existing methods.
Contribution
It proposes a novel parallel rescaling approach that decomposes and adjusts guidance signals, enhancing personalization stability and accuracy without additional data or annotations.
Findings
Improved prompt alignment over baseline methods
Enhanced visual fidelity on stylized prompts
More stable personalization across diverse inputs
Abstract
Personalizing diffusion models to specific users or concepts remains challenging, particularly when only a few reference images are available. Existing methods such as DreamBooth and Textual Inversion often overfit to limited data, causing misalignment between generated images and text prompts when attempting to balance identity fidelity with prompt adherence. While Direct Consistency Optimization (DCO) with its consistency-guided sampling partially alleviates this issue, it still struggles with complex or stylized prompts. In this paper, we propose a parallel rescaling technique for personalized diffusion models. Our approach explicitly decomposes the consistency guidance signal into parallel and orthogonal components relative to classifier free guidance (CFG). By rescaling the parallel component, we minimize disruptive interference with CFG while preserving the subject's identity.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
