Progressive Text-to-3D Generation for Automatic 3D Prototyping

Han Yi; Zhedong Zheng; Xiangyu Xu; Tat-seng Chua

arXiv:2309.14600·cs.CV·September 27, 2023

Progressive Text-to-3D Generation for Automatic 3D Prototyping

Han Yi, Zhedong Zheng, Xiangyu Xu, Tat-seng Chua

PDF

Open Access 1 Repo

TL;DR

This paper introduces a progressive learning approach with a Multi-Scale Triplane Network for text-to-3D generation, effectively capturing fine details and improving 3D shape synthesis from natural language descriptions.

Contribution

It proposes a novel multi-scale triplane network and a progressive learning strategy to enhance detail recovery and optimization in text-to-3D generation tasks.

Findings

01

Outperforms existing methods on challenging descriptions

02

Successfully captures fine-grained 3D details

03

Produces viable 3D shapes from natural language descriptions

Abstract

Text-to-3D generation is to craft a 3D object according to a natural language description. This can significantly reduce the workload for manually designing 3D models and provide a more natural way of interaction for users. However, this problem remains challenging in recovering the fine-grained details effectively and optimizing a large-size 3D output efficiently. Inspired by the success of progressive learning, we propose a Multi-Scale Triplane Network (MTN) and a new progressive learning strategy. As the name implies, the Multi-Scale Triplane Network consists of four triplanes transitioning from low to high resolution. The low-resolution triplane could serve as an initial shape for the high-resolution ones, easing the optimization difficulty. To further enable the fine-grained details, we also introduce the progressive learning strategy, which explicitly demands the network to shift…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Texaser/MTN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Human Motion and Animation · Image Processing and 3D Reconstruction

MethodsFocus