TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing
Teng Xu, Jiamin Chen, Peng Chen, Youjia Zhang, Junqing Yu, Wei Yang

TL;DR
TIGER introduces a novel bottom-up language aggregation and a multi-view diffusion-based score distillation method for coherent, open-vocabulary 3D Gaussian retrieval and editing, improving consistency and realism.
Contribution
It presents a systematic approach combining dense language embedding and multi-view diffusion for effective 3D Gaussian scene editing.
Findings
Achieves more consistent and realistic 3D scene edits.
Supports open-vocabulary retrieval in 3D Gaussian scenes.
Overcomes over-smoothing and inconsistency issues in editing.
Abstract
Editing objects within a scene is a critical functionality required across a broad spectrum of applications in computer vision and graphics. As 3D Gaussian Splatting (3DGS) emerges as a frontier in scene representation, the effective modification of 3D Gaussian scenes has become increasingly vital. This process entails accurately retrieve the target objects and subsequently performing modifications based on instructions. Though available in pieces, existing techniques mainly embed sparse semantics into Gaussians for retrieval, and rely on an iterative dataset update paradigm for editing, leading to over-smoothing or inconsistency issues. To this end, this paper proposes a systematic approach, namely TIGER, for coherent text-instructed 3D Gaussian retrieval and editing. In contrast to the top-down language grounding approach for 3D Gaussians, we adopt a bottom-up language aggregation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Analysis and Summarization · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
