From Simple to Complex Skills: The Case of In-Hand Object Reorientation
Haozhi Qi, Brent Yi, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra, Malik

TL;DR
This paper introduces a hierarchical policy system that leverages low-level skills and a pose estimator to improve in-hand object reorientation, enhancing robustness and sim-to-real transfer for complex manipulation tasks.
Contribution
The work presents a hierarchical policy framework and a generalizable pose estimator that together enable more robust and transferable in-hand object reorientation skills.
Findings
Hierarchical policy improves robustness to out-of-distribution changes.
System successfully reorients symmetrical and textureless objects.
Enhanced sim-to-real transfer performance.
Abstract
Learning policies in simulation and transferring them to the real world has become a promising approach in dexterous manipulation. However, bridging the sim-to-real gap for each new task requires substantial human effort, such as careful reward engineering, hyperparameter tuning, and system identification. In this work, we present a system that leverages low-level skills to address these challenges for more complex tasks. Specifically, we introduce a hierarchical policy for in-hand object reorientation based on previously acquired rotation skills. This hierarchical policy learns to select which low-level skill to execute based on feedback from both the environment and the low-level skill policies themselves. Compared to learning from scratch, the hierarchical policy is more robust to out-of-distribution changes and transfers easily from simulation to real-world environments.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVirtual Reality Applications and Impacts · Spatial Cognition and Navigation · Visual and Cognitive Learning Processes
