Constrained Sampling to Guide Universal Manipulation RL
Marc Toussaint, Cornelius V. Braun, Eckart Cobo-Briesewitz, Sayantan Auddy, Armand Jordana, Justin Carpentier

TL;DR
This paper introduces Sample-Guided RL, a method that uses model-based constraint solvers to efficiently sample feasible states, guiding reinforcement learning to discover complex manipulation strategies in contact-rich environments.
Contribution
The paper presents a novel approach combining model-based sampling with RL to improve exploration and policy learning in complex manipulation tasks.
Findings
Achieves high success rates in a double sphere manipulation setting.
Demonstrates complex manipulation strategies in a panda arm environment.
Outperforms baseline methods in success rate in contact-rich tasks.
Abstract
We consider how model-based solvers can be leveraged to guide training of a universal policy to control from any feasible start state to any feasible goal in a contact-rich manipulation setting. While Reinforcement Learning (RL) has demonstrated its strength in such settings, it may struggle to sufficiently explore and discover complex manipulation strategies, especially in sparse-reward settings. Our approach is based on the idea of a lower-dimensional manifold of feasible, likely-visited states during such manipulation and to guide RL with a sampler from this manifold. We propose Sample-Guided RL, which uses model-based constraint solvers to efficiently sample feasible configurations (satisfying differentiable collision, contact, and force constraints) and leverage them to guide RL for universal (goal-conditioned) manipulation policies. We study using this data directly to bias state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Motor Control and Adaptation
