3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation
Jinzhi Zhang, Feng Xiong, Mu Xu

TL;DR
This paper introduces the Variational Tokenizer (VAT), a novel method for efficient 3D data compression and autoregressive generation, enabling high-fidelity 3D shape synthesis with significant reduction in data size.
Contribution
The paper proposes VAT, a hierarchical variational tokenizer that transforms unordered 3D data into compact tokens suitable for autoregressive modeling, addressing the challenge of 3D data's lack of inherent order.
Findings
VAT achieves up to 250x compression of 3D meshes.
VAT outperforms existing methods in quality and efficiency.
VAT maintains high F-score at extreme compression levels.
Abstract
Autoregressive transformers have revolutionized high-fidelity image generation. One crucial ingredient lies in the tokenizer, which compresses high-resolution image patches into manageable discrete tokens with a scanning or hierarchical order suitable for large language models. Extending these tokenizers to 3D generation, however, presents a significant challenge: unlike image patches that naturally exhibit spatial sequence and multi-scale relationships, 3D data lacks an inherent order, making it difficult to compress into fewer tokens while preserving structural details. To address this, we introduce the Variational Tokenizer (VAT), which transforms unordered 3D data into compact latent tokens with an implicit hierarchy, suited for efficient and high-fidelity coarse-to-fine autoregressive modeling. VAT begins with an in-context transformer, which compress numerous unordered 3D features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Advanced Neural Network Applications · Advanced Electron Microscopy Techniques and Applications
MethodsSparse Evolutionary Training
