Velocity-Space 3D Asset Editing

Hao Liu; Yuxuan Lin; Jingfeng Guo; Ruihang Chu; Junjie Wang; Ruotong Li; Yujiu Yang

arXiv:2605.07385·cs.GR·May 11, 2026

Velocity-Space 3D Asset Editing

Hao Liu, Yuxuan Lin, Jingfeng Guo, Ruihang Chu, Junjie Wang, Ruotong Li, Yujiu Yang

PDF

TL;DR

VS3D is a novel, training-free framework for local 3D asset editing that intervenes inside the sampler to improve fidelity, amplification, and preservation of edits without external masks.

Contribution

It introduces a new inside-sampler intervention approach with three modules, addressing key issues in local 3D editing without training or masks.

Findings

01

Addresses identity leakage with source anchoring

02

Enhances edit amplification through velocity guidance

03

Improves preservation with token-wise residual injection

Abstract

Editing a 3D asset locally, modifying a target region while preserving the rest, is a fundamental requirement of native 3D editing. Existing methods enforce locality through mechanisms external to the generator, such as manual 3D masks, post-hoc voxel merging, or 2D multi-view lifting. None of them intervene where the corruption actually originates: inside the ODE sampler. For a rectified-flow generator to achieve faithful local editing, its velocity field should be strong over the target editing region while vanishing on preserved content. Yet a single velocity field can hardly satisfy both requirements simultaneously, leading to three problems: (i) identity leakage that keeps the edit signal non-zero on preserved regions; (ii) no dedicated edit-amplification channel, so strengthening the edit inevitably perturbs identity; and (iii) an identity drag at the geometry and material stages,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.