SketchDream: Sketch-based Text-to-3D Generation and Editing
Feng-Lin Liu, Hongbo Fu, Yu-Kun Lai, Lin Gao

TL;DR
SketchDream introduces a novel text-and-sketch driven 3D generation and editing framework that enhances control over geometry and appearance, leveraging multi-view diffusion models and a coarse-to-fine editing process for high-quality results.
Contribution
The paper presents a new method combining sketches and text for 3D generation, with multi-view diffusion models and a coarse-to-fine editing approach for improved control and quality.
Findings
Outperforms existing 2D ControlNet and image-to-3D methods in quality.
Enables detailed local editing with high fidelity.
Achieves consistent multi-view 3D generation.
Abstract
Existing text-based 3D generation methods generate attractive results but lack detailed geometry control. Sketches, known for their conciseness and expressiveness, have contributed to intuitive 3D modeling but are confined to producing texture-less mesh models within predefined categories. Integrating sketch and text simultaneously for 3D generation promises enhanced control over geometry and appearance but faces challenges from 2D-to-3D translation ambiguity and multi-modal condition integration. Moreover, further editing of 3D models in arbitrary views will give users more freedom to customize their models. However, it is difficult to achieve high generation quality, preserve unedited regions, and manage proper interactions between shape components. To solve the above issues, we propose a text-driven 3D content generation and editing method, SketchDream, which supports NeRF generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation
