PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing
Peilin Xiong, Junwen Chen, Honghui Yuan, Keiji Yanai

TL;DR
PosBridge introduces a training-free, scalable image editing framework that uses positional embedding transplant to accurately insert objects into scenes, maintaining structural and appearance consistency.
Contribution
The paper presents PosBridge, a novel framework employing positional embedding transplant and Corner Centered Layout for identity-aware image editing without training.
Findings
Outperforms baselines in structural consistency.
Achieves high appearance fidelity.
Offers improved computational efficiency.
Abstract
Localized subject-driven image editing aims to seamlessly integrate user-specified objects into target scenes. As generative models continue to scale, training becomes increasingly costly in terms of memory and computation, highlighting the need for training-free and scalable editing frameworks.To this end, we propose PosBridge an efficient and flexible framework for inserting custom objects. A key component of our method is positional embedding transplant, which guides the diffusion model to faithfully replicate the structural characteristics of reference objects.Meanwhile, we introduce the Corner Centered Layout, which concatenates reference images and the background image as input to the FLUX.1-Fill model. During progressive denoising, positional embedding transplant is applied to guide the noise distribution in the target region toward that of the reference object. In this way,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
