Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model
Lu Yu, Wei Xiang, Kang Han

TL;DR
Edit-DiffNeRF introduces a novel framework combining frozen diffusion models, a delta editing module, and NeRF to enable fine-grained, semantically consistent 3D scene editing guided by text instructions, improving alignment accuracy.
Contribution
The paper proposes a new method for editing 3D scenes using a frozen diffusion model and a delta module, addressing cross-view inconsistency and enabling precise semantic modifications.
Findings
Achieves 25% improvement in edit-text alignment accuracy
Effectively edits real-world 3D scenes
Ensures multi-view semantic consistency
Abstract
Recent research has demonstrated that the combination of pretrained diffusion models with neural radiance fields (NeRFs) has emerged as a promising approach for text-to-3D generation. Simply coupling NeRF with diffusion models will result in cross-view inconsistency and degradation of stylized view syntheses. To address this challenge, we propose the Edit-DiffNeRF framework, which is composed of a frozen diffusion model, a proposed delta module to edit the latent semantic space of the diffusion model, and a NeRF. Instead of training the entire diffusion for each scene, our method focuses on editing the latent semantic space in frozen pretrained diffusion models by the delta module. This fundamental change to the standard diffusion framework enables us to make fine-grained modifications to the rendered views and effectively consolidate these instructions in a 3D scene via NeRF training.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Human Pose and Action Recognition
MethodsDiffusion
