Exploiting Policy Idling for Dexterous Manipulation
Annie S. Chen, Philemon Brakel, Antonia Bronars, Annie Xie, Sandy Huang, Oliver Groth, Maria Bauza, Markus Wulfmeier, Nicolas Heess, Dushyant Rao

TL;DR
This paper introduces Pause-Induced Perturbations (PIP), a novel method that detects and perturbs idling states in dexterous manipulation policies, significantly improving robustness and success rates without extra supervision.
Contribution
The paper proposes PIP, a new exploration strategy that leverages idling detection to enhance policy learning and robustness in dexterous manipulation tasks.
Findings
PIP improves test-time performance in simulated tasks.
Learning from episodes with idling leads to better policy iteration.
Real-world insertion success rate increases by 15-35%.
Abstract
Learning-based methods for dexterous manipulation have made notable progress in recent years. However, learned policies often still lack reliability and exhibit limited robustness to important factors of variation. One failure pattern that can be observed across many settings is that policies idle, i.e. they cease to move beyond a small region of states when they reach certain states. This policy idling is often a reflection of the training data. For instance, it can occur when the data contains small actions in areas where the robot needs to perform high-precision motions, e.g., when preparing to grasp an object or object insertion. Prior works have tried to mitigate this phenomenon e.g. by filtering the training data or modifying the control frequency. However, these approaches can negatively impact policy performance in other ways. As an alternative, we investigate how to leverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Motor Control and Adaptation
