ManipulaTHOR: A Framework for Visual Object Manipulation

Kiana Ehsani; Winson Han; Alvaro Herrasti; Eli VanderBilt; Luca Weihs,; Eric Kolve; Aniruddha Kembhavi; Roozbeh Mottaghi

arXiv:2104.11213·cs.CV·April 23, 2021

ManipulaTHOR: A Framework for Visual Object Manipulation

Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs,, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi

PDF

1 Repo

TL;DR

ManipulaTHOR introduces a new framework and challenge for embodied AI focused on complex object manipulation in visually rich, dynamic environments, emphasizing long-horizon planning and generalization.

Contribution

The paper presents a physics-enabled, visually rich framework for object manipulation and introduces the ArmPointNav challenge to advance research in complex, real-world manipulation tasks.

Findings

01

Existing methods show promise but need significant improvement.

02

The ArmPointNav challenge highlights key difficulties like obstacle avoidance and occlusion.

03

Framework enables testing manipulation in complex, unseen environments.

Abstract

The domain of Embodied AI has recently witnessed substantial progress, particularly in navigating agents within their environments. These early successes have laid the building blocks for the community to tackle tasks that require agents to actively interact with objects in their environment. Object manipulation is an established research domain within the robotics community and poses several challenges including manipulator motion, grasping and long-horizon planning, particularly when dealing with oft-overlooked practical setups involving visually rich and complex scenes, manipulation using mobile agents (as opposed to tabletop manipulation), and generalization to unseen environments and objects. We propose a framework for object manipulation built upon the physics-enabled, visually rich AI2-THOR framework and present a new challenge to the Embodied AI community known as ArmPointNav.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/manipulathor
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.