Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling
Hayeon Kim, Ji Ha Jang, Se Young Chun

TL;DR
This paper introduces RoMaP, a novel framework for precise, robust, and flexible part-level editing of 3D Gaussian representations, addressing previous limitations in multi-view segmentation and loss ambiguity.
Contribution
RoMaP combines a view-dependent 3D mask generation with a regularized score distillation loss, enabling accurate and drastic local 3D edits with improved consistency and flexibility.
Findings
Achieves state-of-the-art local 3D editing results.
Provides robust and consistent part segmentation across viewpoints.
Enables flexible modifications beyond existing context.
Abstract
Recent advances in 3D neural representations and instance-level editing models have enabled the efficient creation of high-quality 3D content. However, achieving precise local 3D edits remains challenging, especially for Gaussian Splatting, due to inconsistent multi-view 2D part segmentations and inherently ambiguous nature of Score Distillation Sampling (SDS) loss. To address these limitations, we propose RoMaP, a novel local 3D Gaussian editing framework that enables precise and drastic part-level modifications. First, we introduce a robust 3D mask generation module with our 3D-Geometry Aware Label Prediction (3D-GALP), which uses spherical harmonics (SH) coefficients to model view-dependent label variations and soft-label property, yielding accurate and consistent part segmentations across viewpoints. Second, we propose a regularized SDS loss that combines the standard SDS loss with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Additive Manufacturing and 3D Printing Technologies · Interactive and Immersive Displays
MethodsAttentive Walk-Aggregating Graph Neural Network
