BRIDGE: Background Routing and Isolated Discrete Gating for Coarse-Mask Local Editing
Peilin Xiong, Honghui Yuan, Junwen Chen, and Keiji Yanai

TL;DR
BRIDGE introduces a novel approach for coarse-mask local image editing that avoids mask shape bias by separating support construction from the editing backbone and employing a learnable token routing gate.
Contribution
It proposes a new framework with background support outside the backbone and a discrete geometric gate for flexible, bias-free local image editing.
Findings
BRIDGE improves local editing metrics on BRIDGE-Bench.
Zero-shot results show competitive alignment and source preservation.
The routing module remains compact at 13.31M parameters.
Abstract
Coarse-mask local image editing asks a model to modify a user-indicated region while preserving the surrounding scene. In practice, however, rough masks often become unintended shape priors: instead of serving as flexible edit support, the mask can pull the generated object toward its accidental boundary. We study this failure as mask-shape bias and frame the task through a Two-Zone Constraint, where the background should remain stable while the editable region should follow the instruction without being forced to inherit the mask contour. BRIDGE addresses this setting by keeping masks outside the DiT backbone for support construction and blending, avoiding DiT-internal mask injection and copied control branches. It uses BridgePath generation, where a Main Path preserves background context and a Subject Path generates editable content from independent noise. Motivated by a diagnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
