Editing 3D Scenes via Text Prompts without Retraining
Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding,, Shuchang Zhou, Ming-Hsuan Yang

TL;DR
This paper introduces DN2N, a text-driven 3D scene editing method that enables universal editing capabilities without retraining, using off-the-shelf 2D image editing models and a filtering process to maintain 3D consistency.
Contribution
The paper presents a novel approach that allows direct, retraining-free editing of 3D scenes via text prompts, leveraging cross-view regularization and noise removal techniques.
Findings
Enables multiple editing types like appearance, weather, material, and style transfer.
Achieves generalization across different scenes without scene-specific retraining.
Maintains 3D consistency during editing through filtering and regularization.
Abstract
Numerous diffusion models have recently been applied to image synthesis and editing. However, editing 3D scenes is still in its early stages. It poses various challenges, such as the requirement to design specific methods for different editing types, retraining new models for various 3D scenes, and the absence of convenient human interaction during editing. To tackle these issues, we introduce a text-driven editing method, termed DN2N, which allows for the direct acquisition of a NeRF model with universal editing capabilities, eliminating the requirement for retraining. Our method employs off-the-shelf text-based editing models of 2D images to modify the 3D scene images, followed by a filtering process to discard poorly edited images that disrupt 3D consistency. We then consider the remaining inconsistency as a problem of removing noise perturbation, which can be solved by generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
MethodsDiffusion
