PolyGen: An Autoregressive Generative Model of 3D Meshes
Charlie Nash, Yaroslav Ganin, S. M. Ali Eslami, Peter W. Battaglia

TL;DR
PolyGen introduces a Transformer-based autoregressive model that directly generates 3D polygon meshes, enabling high-quality, probabilistic mesh synthesis conditioned on various inputs, advancing 3D shape modeling.
Contribution
It is the first to model 3D meshes directly with a Transformer, predicting vertices and faces sequentially, and can condition on multiple input modalities.
Findings
Produces high-quality, usable meshes
Establishes log-likelihood benchmarks for mesh modeling
Demonstrates competitive surface reconstruction performance
Abstract
Polygon meshes are an efficient representation of 3D geometry, and are of central importance in computer graphics, robotics and games development. Existing learning-based approaches have avoided the challenges of working with 3D meshes, instead using alternative object representations that are more compatible with neural architectures and training approaches. We present an approach which models the mesh directly, predicting mesh vertices and faces sequentially using a Transformer-based architecture. Our model can condition on a range of inputs, including object classes, voxels, and images, and because the model is probabilistic it can produce samples that capture uncertainty in ambiguous scenarios. We show that the model is capable of producing high-quality, usable meshes, and establish log-likelihood benchmarks for the mesh-modelling task. We also evaluate the conditional models on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Image Processing and 3D Reconstruction
