Editing 3D Scenes via Text Prompts without Retraining

Shuangkang Fang; Yufeng Wang; Yi Yang; Yi-Hsuan Tsai; Wenrui Ding,; Shuchang Zhou; Ming-Hsuan Yang

arXiv:2309.04917·cs.CV·December 1, 2023·1 cites

Editing 3D Scenes via Text Prompts without Retraining

Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding,, Shuchang Zhou, Ming-Hsuan Yang

PDF

Open Access

TL;DR

This paper introduces DN2N, a text-driven 3D scene editing method that enables universal editing capabilities without retraining, using off-the-shelf 2D image editing models and a filtering process to maintain 3D consistency.

Contribution

The paper presents a novel approach that allows direct, retraining-free editing of 3D scenes via text prompts, leveraging cross-view regularization and noise removal techniques.

Findings

01

Enables multiple editing types like appearance, weather, material, and style transfer.

02

Achieves generalization across different scenes without scene-specific retraining.

03

Maintains 3D consistency during editing through filtering and regularization.

Abstract

Numerous diffusion models have recently been applied to image synthesis and editing. However, editing 3D scenes is still in its early stages. It poses various challenges, such as the requirement to design specific methods for different editing types, retraining new models for various 3D scenes, and the absence of convenient human interaction during editing. To tackle these issues, we introduce a text-driven editing method, termed DN2N, which allows for the direct acquisition of a NeRF model with universal editing capabilities, eliminating the requirement for retraining. Our method employs off-the-shelf text-based editing models of 2D images to modify the 3D scene images, followed by a filtering process to discard poorly edited images that disrupt 3D consistency. We then consider the remaining inconsistency as a problem of removing noise perturbation, which can be solved by generating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques

MethodsDiffusion