ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke, Eli VanderBilt, Alvaro Herrasti, Luca Weihs, Jordi, Salvador, Kiana Ehsani, Winson Han, Eric Kolve, Ali Farhadi, Aniruddha, Kembhavi, Roozbeh Mottaghi

TL;DR
ProcTHOR is a scalable platform for generating large, diverse virtual environments to train and evaluate embodied AI agents, achieving state-of-the-art results without explicit mapping or task supervision.
Contribution
We introduce ProcTHOR, a procedural environment generation framework enabling large-scale, customizable datasets for embodied AI, leading to improved performance on multiple benchmarks.
Findings
Models trained on ProcTHOR achieve state-of-the-art results.
ProcTHOR enables effective zero-shot transfer to downstream tasks.
Large, diverse datasets improve embodied AI training outcomes.
Abstract
Massive datasets and high-capacity models have driven many recent advancements in computer vision and natural language understanding. This work presents a platform to enable similar success stories in Embodied AI. We propose ProcTHOR, a framework for procedural generation of Embodied AI environments. ProcTHOR enables us to sample arbitrarily large datasets of diverse, interactive, customizable, and performant virtual environments to train and evaluate embodied agents across navigation, interaction, and manipulation tasks. We demonstrate the power and potential of ProcTHOR via a sample of 10,000 generated houses and a simple neural model. Models trained using only RGB images on ProcTHOR, with no explicit mapping and no human task supervision produce state-of-the-art results across 6 embodied AI benchmarks for navigation, rearrangement, and arm manipulation, including the presently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Robot Manipulation and Learning
