Towards a Training Free Approach for 3D Scene Editing
Vivek Madhavaram, Shivangana Rawat, Chaitanya Devaguptapu, Charu, Sharma, Manohar Kaul

TL;DR
FreeEdit introduces a training-free, mesh-based approach for 3D scene editing driven by text prompts, enabling real-time, versatile modifications without scene-specific training.
Contribution
It presents a novel training-free method leveraging foundation models for 3D editing with insertion, replacement, and deletion operations using mesh representations.
Findings
Outperforms baseline models in accuracy and speed
Supports real-time, training-free 3D scene editing
Effective across diverse 3D scenes
Abstract
Text driven diffusion models have shown remarkable capabilities in editing images. However, when editing 3D scenes, existing works mostly rely on training a NeRF for 3D editing. Recent NeRF editing methods leverages edit operations by deploying 2D diffusion models and project these edits into 3D space. They require strong positional priors alongside text prompt to identify the edit location. These methods are operational on small 3D scenes and are more generalized to particular scene. They require training for each specific edit and cannot be exploited in real-time edits. To address these limitations, we propose a novel method, FreeEdit, to make edits in training free manner using mesh representations as a substitute for NeRF. Training-free methods are now a possibility because of the advances in foundation model's space. We leverage these models to bring a training-free alternative and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis
MethodsDiffusion
