Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings
Aditya Sanghi, Aliasghar Khani, Pradyumna Reddy, Arianna Rampini,, Derek Cheung, Kamal Rahimi Malekshan, Kanika Madan, Hooman Shayani

TL;DR
WaLa introduces a wavelet-based encoding for 3D shapes that enables billion-parameter generative models to produce high-quality, detailed 3D shapes efficiently, with significant compression and rapid inference.
Contribution
The paper presents a novel wavelet-based encoding method that compresses 3D shapes for large-scale generative models, enabling efficient training and fast inference at high resolutions.
Findings
2427x compression ratio with minimal detail loss
Generation of high-quality 3D shapes within seconds
State-of-the-art performance across multiple datasets
Abstract
Large-scale 3D generative models require substantial computational resources yet often fall short in capturing fine details and complex geometries at high resolutions. We attribute this limitation to the inefficiency of current representations, which lack the compactness required to model the generative models effectively. To address this, we introduce a novel approach called Wavelet Latent Diffusion, or WaLa, that encodes 3D shapes into wavelet-based, compact latent encodings. Specifically, we compress a signed distance field into a latent grid, achieving an impressive 2427x compression ratio with minimal loss of detail. This high level of compression allows our method to efficiently train large-scale generative networks without increasing the inference time. Our models, both conditional and unconditional, contain approximately one billion parameters and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction
MethodsDiffusion
