Native and Compact Structured Latents for 3D Generation
Jianfeng Xiang, Xiaoxue Chen, Sicheng Xu, Ruicheng Wang, Zelong Lv, Yu Deng, Hongyuan Zhu, Yue Dong, Hao Zhao, Nicholas Jing Yuan, Jiaolong Yang

TL;DR
This paper introduces O-Voxel, a novel sparse voxel structure for 3D generative modeling that captures complex topologies and detailed surface attributes, enabling high-quality, efficient 3D asset generation.
Contribution
The paper presents O-Voxel, a new omni-voxel representation and a Sparse Compression VAE for improved 3D asset modeling with complex topologies and rich surface details.
Findings
O-Voxel effectively models arbitrary topologies.
Large-scale flow-matching models generate high-quality 3D assets.
Inference remains efficient despite model scale.
Abstract
Recent advancements in 3D generative modeling have significantly improved the generation realism, yet the field is still hampered by existing representations, which struggle to capture assets with complex topologies and detailed appearance. This paper present an approach for learning a structured latent representation from native 3D data to address this challenge. At its core is a new sparse voxel structure called O-Voxel, an omni-voxel representation that encodes both geometry and appearance. O-Voxel can robustly model arbitrary topology, including open, non-manifold, and fully-enclosed surfaces, while capturing comprehensive surface attributes beyond texture color, such as physically-based rendering parameters. Based on O-Voxel, we design a Sparse Compression VAE which provides a high spatial compression rate and a compact latent space. We train large-scale flow-matching models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗microsoft/TRELLIS.2-4Bmodel· ♡ 719♡ 719
- 🤗camenduru/TRELLIS.2-4Bmodel
- 🤗mancub/TRELLIS.2-4Bmodel· ♡ 1♡ 1
- 🤗athena2634/TRELLIS.2-4Bmodel
- 🤗rogrocks123/TRELLIS.2-4Bmodel
- 🤗jacobperalesfx/TRELLIS.2-4Bmodel
- 🤗aerovfx/TRELLIS.2-4Bmodel
- 🤗hivenimbus/TRELLIS.2-4Bmodel
- 🤗KokkakNiphon/TRELLIS.2-4Bmodel
- 🤗mr4425390/TRELLIS.2-4Bmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
