Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass
Chen Liyi, Wang Pengfei, Zhang Guowen, Ma Zhiyuan, Zhang Lei

TL;DR
Omni-3DEdit is a unified, learning-based 3D editing model that efficiently performs various editing tasks in a single pass, overcoming the limitations of traditional iterative, task-specific methods.
Contribution
It introduces a generalized 3D editing framework that leverages synthesized multi-view data and a novel dual-stream LoRA module, enabling fast, one-pass editing without task-specific rules.
Findings
Reduces editing inference time from tens of minutes to about two minutes.
Demonstrates effectiveness across multiple 3D editing tasks.
Outperforms traditional iterative methods in efficiency and flexibility.
Abstract
Most instruction-driven 3D editing methods rely on 2D models to guide the explicit and iterative optimization of 3D representations. This paradigm, however, suffers from two primary drawbacks. First, it lacks a universal design of different 3D editing tasks because the explicit manipulation of 3D geometry necessitates task-dependent rules, e.g., 3D appearance editing demands inherent source 3D geometry, while 3D removal alters source geometry. Second, the iterative optimization process is highly time-consuming, often requiring thousands of invocations of 2D/3D updating. We present Omni-3DEdit, a unified, learning-based model that generalizes various 3D editing tasks implicitly. One key challenge to achieve our goal is the scarcity of paired source-edited multi-view assets for training. To address this issue, we construct a data pipeline, synthesizing a relatively rich number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Interactive and Immersive Displays · Generative Adversarial Networks and Image Synthesis
