AI2-THOR: An Interactive 3D Environment for Visual AI
Eric Kolve, Roozbeh Mottaghi, Winson Han, Eli VanderBilt, Luca Weihs,, Alvaro Herrasti, Matt Deitke, Kiana Ehsani, Daniel Gordon, Yuke Zhu,, Aniruddha Kembhavi, Abhinav Gupta, Ali Farhadi

TL;DR
AI2-THOR is a highly realistic 3D environment for visual AI research, enabling agents to navigate and interact with objects to advance various AI tasks and learning paradigms.
Contribution
It introduces a versatile, photo-realistic 3D platform for visual AI research, supporting diverse tasks and fostering progress in building visually intelligent systems.
Findings
Supports multiple AI research domains including reinforcement learning and object detection
Provides near photo-realistic indoor scenes for realistic interaction
Facilitates development of visually intelligent models
Abstract
We introduce The House Of inteRactions (THOR), a framework for visual AI research, available at http://ai2thor.allenai.org. AI2-THOR consists of near photo-realistic 3D indoor scenes, where AI agents can navigate in the scenes and interact with objects to perform tasks. AI2-THOR enables research in many different domains including but not limited to deep reinforcement learning, imitation learning, learning by interaction, planning, visual question answering, unsupervised representation learning, object detection and segmentation, and learning models of cognition. The goal of AI2-THOR is to facilitate building visually intelligent models and push the research forward in this domain.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
