3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
Tejaswini Medi, Arianna Rampini, Pradyumna Reddy, Pradeep Kumar Jayaraman, Margret Keuper

TL;DR
3D-WAG introduces a hierarchical autoregressive model using wavelet tokens and Transformers for efficient, high-fidelity 3D shape generation, outperforming existing methods in quality and computational cost.
Contribution
The paper presents a novel wavelet-based hierarchical autoregressive approach for 3D shape generation, reducing computational costs while maintaining detailed geometric fidelity.
Findings
Achieves superior Coverage and MMD metrics compared to state-of-the-art methods.
Enables unconditional, class-conditioned, and text-conditioned 3D shape generation.
Reduces computational cost by predicting next-scale tokens instead of next-token in 3D AR models.
Abstract
Autoregressive (AR) models have achieved remarkable success in natural language and image generation, but their application to 3D shape modeling remains largely unexplored. Unlike diffusion models, AR models enable more efficient and controllable generation with faster inference times, making them especially suitable for data-intensive domains. Traditional 3D generative models using AR approaches often rely on ``next-token" predictions at the voxel or point level. While effective for certain applications, these methods can be restrictive and computationally expensive when dealing with large-scale 3D data. To tackle these challenges, we introduce 3D-WAG, an AR model for 3D implicit distance fields that can perform unconditional shape generation, class-conditioned and also text-conditioned shape generation. Our key idea is to encode shapes as multi-scale wavelet token maps and use a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical measurement and interference techniques · Optical Systems and Laser Technology · Advanced Optical Imaging Technologies
MethodsAttention Is All You Need · Residual Connection · Softmax · Adam · Label Smoothing · Dropout · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding
