LaGeM: A Large Geometry Model for 3D Representation Learning and   Diffusion

Biao Zhang; Peter Wonka

arXiv:2410.01295·cs.CV·October 3, 2024

LaGeM: A Large Geometry Model for 3D Representation Learning and Diffusion

Biao Zhang, Peter Wonka

PDF

Open Access

TL;DR

This paper presents LaGeM, a hierarchical autoencoder for 3D models that efficiently compresses and represents complex geometries, enabling improved generative modeling with diffusion techniques on unordered vector sets.

Contribution

Introduces a hierarchical autoencoder for 3D models operating on unordered vectors, enabling efficient large-scale representation and diffusion-based generative modeling.

Findings

01

Model achieves high-resolution geometry representation.

02

Training requires less time and memory than baseline.

03

Effective for generative modeling with cascaded diffusion.

Abstract

This paper introduces a novel hierarchical autoencoder that maps 3D models into a highly compressed latent space. The hierarchical autoencoder is specifically designed to tackle the challenges arising from large-scale datasets and generative modeling using diffusion. Different from previous approaches that only work on a regular image or volume grid, our hierarchical autoencoder operates on unordered sets of vectors. Each level of the autoencoder controls different geometric levels of detail. We show that the model can be used to represent a wide range of 3D models while faithfully representing high-resolution geometry details. The training of the new architecture takes 0.70x time and 0.58x memory compared to the baseline. We also explore how the new representation can be used for generative modeling. Specifically, we propose a cascaded diffusion framework where each stage is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Medical Imaging and Analysis · Image Processing and 3D Reconstruction

MethodsDiffusion