Predicting Motion Plans for Articulating Everyday Objects
Arjun Gupta, Max E. Shepherd, Saurabh Gupta

TL;DR
This paper introduces a learning-based approach for predicting motion plans in mobile manipulation tasks involving articulated objects, leveraging a new simulator and representation to improve speed and accuracy in novel environments.
Contribution
The paper presents a novel simulator, SeqIK+$ heta_0$ representation, and learning models for fast, accurate motion plan prediction in articulated object manipulation.
Findings
Outperforms search-based methods in speed and accuracy
Enables real-time motion planning in novel environments
Demonstrates effectiveness on diverse articulated objects
Abstract
Mobile manipulation tasks such as opening a door, pulling open a drawer, or lifting a toilet lid require constrained motion of the end-effector under environmental and task constraints. This, coupled with partial information in novel environments, makes it challenging to employ classical motion planning approaches at test time. Our key insight is to cast it as a learning problem to leverage past experience of solving similar planning problems to directly predict motion plans for mobile manipulation tasks in novel situations at test time. To enable this, we develop a simulator, ArtObjSim, that simulates articulated objects placed in real scenes. We then introduce SeqIK+, a fast and flexible representation for motion plans. Finally, we learn models that use SeqIK+ to quickly predict motion plans for articulating novel objects at test time. Experimental evaluation shows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Human Motion and Animation
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
