3D-Consistent Multi-View Editing by Correspondence Guidance
Josef Bengtson, David Nilsson, Dong In Lee, Yaroslava Lochman, Fredrik Kahl

TL;DR
This paper introduces a training-free guidance framework that enforces multi-view consistency during image editing, significantly improving 3D consistency and detail preservation in multi-view and 3D scene editing tasks.
Contribution
It proposes a novel consistency loss guiding denoising for multi-view editing, enabling coherent edits without additional training and supporting various editing setups.
Findings
Enhanced 3D consistency over existing methods
Supports both dense and sparse multi-view editing
Achieves high-quality Gaussian splat editing with detailed fidelity
Abstract
Recent advancements in diffusion and flow models have greatly improved text-based image editing, yet methods that edit images independently often produce geometrically and photometrically inconsistent results across different views of the same scene. Such inconsistencies are particularly problematic for editing of 3D representations such as NeRFs or Gaussian splat models. We propose a training-free guidance framework that enforces multi-view consistency during the image editing process. The key idea is that corresponding points should look similar after editing. To achieve this, we introduce a consistency loss that guides the denoising process toward coherent edits. The framework is flexible and can be combined with widely varying image editing methods, supporting both dense and sparse multi-view editing setups. Experimental results show that our approach significantly improves 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging
