Objaverse-XL: A Universe of 10M+ 3D Objects

Matt Deitke; Ruoshi Liu; Matthew Wallingford; Huong Ngo; Oscar Michel,; Aditya Kusupati; Alan Fan; Christian Laforte; Vikram Voleti; Samir Yitzhak; Gadre; Eli VanderBilt; Aniruddha Kembhavi; Carl Vondrick; Georgia Gkioxari,; Kiana Ehsani; Ludwig Schmidt; Ali Farhadi

arXiv:2307.05663·cs.CV·July 13, 2023·36 cites

Objaverse-XL: A Universe of 10M+ 3D Objects

Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo, Oscar Michel,, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram Voleti, Samir Yitzhak, Gadre, Eli VanderBilt, Aniruddha Kembhavi, Carl Vondrick, Georgia Gkioxari,, Kiana Ehsani, Ludwig Schmidt, Ali Farhadi

PDF

Open Access 2 Repos 2 Datasets

TL;DR

Objaverse-XL is a large-scale, diverse 3D object dataset with over 10 million items, designed to advance 3D vision tasks by enabling training on extensive data, leading to improved zero-shot generalization.

Contribution

This work introduces Objaverse-XL, the largest and most diverse 3D object dataset to date, facilitating significant progress in 3D vision through large-scale training.

Findings

01

Training Zero123 on Objaverse-XL improves zero-shot generalization.

02

Utilizing over 100 million multi-view images enhances 3D model performance.

03

The dataset enables new possibilities for 3D vision research.

Abstract

Natural language processing and 2D vision models have attained remarkable proficiency on many tasks primarily by escalating the scale of training data. However, 3D vision tasks have not seen the same progress, in part due to the challenges of acquiring high-quality 3D data. In this work, we present Objaverse-XL, a dataset of over 10 million 3D objects. Our dataset comprises deduplicated 3D objects from a diverse set of sources, including manually designed objects, photogrammetry scans of landmarks and everyday items, and professional scans of historic and antique artifacts. Representing the largest scale and diversity in the realm of 3D datasets, Objaverse-XL enables significant new possibilities for 3D vision. Our experiments demonstrate the improvements enabled with the scale provided by Objaverse-XL. We show that by training Zero123 on novel view synthesis, utilizing over 100 million…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage