Progressive Text-to-3D Generation for Automatic 3D Prototyping
Han Yi, Zhedong Zheng, Xiangyu Xu, Tat-seng Chua

TL;DR
This paper introduces a progressive learning approach with a Multi-Scale Triplane Network for text-to-3D generation, effectively capturing fine details and improving 3D shape synthesis from natural language descriptions.
Contribution
It proposes a novel multi-scale triplane network and a progressive learning strategy to enhance detail recovery and optimization in text-to-3D generation tasks.
Findings
Outperforms existing methods on challenging descriptions
Successfully captures fine-grained 3D details
Produces viable 3D shapes from natural language descriptions
Abstract
Text-to-3D generation is to craft a 3D object according to a natural language description. This can significantly reduce the workload for manually designing 3D models and provide a more natural way of interaction for users. However, this problem remains challenging in recovering the fine-grained details effectively and optimizing a large-size 3D output efficiently. Inspired by the success of progressive learning, we propose a Multi-Scale Triplane Network (MTN) and a new progressive learning strategy. As the name implies, the Multi-Scale Triplane Network consists of four triplanes transitioning from low to high resolution. The low-resolution triplane could serve as an initial shape for the high-resolution ones, easing the optimization difficulty. To further enable the fine-grained details, we also introduce the progressive learning strategy, which explicitly demands the network to shift…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Image Processing and 3D Reconstruction
MethodsFocus
