Estimator-Coupled Reinforcement Learning for Robust Purely Tactile In-Hand Manipulation
Lennart R\"ostel, Johannes Pitz, Leon Sievers, Berthold B\"auml

TL;DR
This paper introduces a coupled reinforcement learning and state estimation approach for purely tactile in-hand manipulation, achieving robust control and successful sim2real transfer on diverse objects with limited sensing.
Contribution
The authors propose an estimator-coupled reinforcement learning method that improves robustness and performance in tactile in-hand manipulation tasks, enabling effective sim2real transfer.
Findings
Achieved successful sim2real transfer for diverse object shapes.
Reoriented objects to all 24 orientations in SO(3) discretization.
Reoriented a cube to nine goals consecutively, surpassing previous methods.
Abstract
This paper identifies and addresses the problems with naively combining (reinforcement) learning-based controllers and state estimators for robotic in-hand manipulation. Specifically, we tackle the challenging task of purely tactile, goal-conditioned, dextrous in-hand reorientation with the hand pointing downwards. Due to the limited sensing available, many control strategies that are feasible in simulation when having full knowledge of the object's state do not allow for accurate state estimation. Hence, separately training the controller and the estimator and combining the two at test time leads to poor performance. We solve this problem by coupling the control policy to the state estimator already during training in simulation. This approach leads to more robust state estimation and overall higher performance on the task while maintaining an interpretability advantage over end-to-end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Muscle activation and electromyography studies · Reinforcement Learning in Robotics
MethodsSparse Evolutionary Training
