SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model

Haowen Zheng; Yanyan Liang

arXiv:2411.12290·cs.CV·November 20, 2024

SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model

Haowen Zheng, Yanyan Liang

PDF

Open Access 1 Repo

TL;DR

SSEditor is a novel 3D scene generation framework that allows controllable, target-specific scene editing without multiple resampling steps, improving flexibility and scene quality.

Contribution

It introduces a two-stage diffusion-based framework with a geometric-semantic fusion module for controllable 3D scene editing and generation.

Findings

01

Outperforms previous methods in controllability and quality

02

Capable of generating novel urban scenes on unseen datasets

03

Enables rapid 3D scene construction

Abstract

Recent advancements in 3D diffusion-based semantic scene generation have gained attention. However, existing methods rely on unconditional generation and require multiple resampling steps when editing scenes, which significantly limits their controllability and flexibility. To this end, we propose SSEditor, a controllable Semantic Scene Editor that can generate specified target categories without multiple-step resampling. SSEditor employs a two-stage diffusion-based framework: (1) a 3D scene autoencoder is trained to obtain latent triplane features, and (2) a mask-conditional diffusion model is trained for customizable 3D semantic scene generation. In the second stage, we introduce a geometric-semantic fusion module that enhance the model's ability to learn geometric and semantic information. This ensures that objects are generated with correct positions, sizes, and categories.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SSEditor/SSEditor
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Cell Image Analysis Techniques

MethodsDiffusion