Constrained Sampling to Guide Universal Manipulation RL

Marc Toussaint; Cornelius V. Braun; Eckart Cobo-Briesewitz; Sayantan Auddy; Armand Jordana; Justin Carpentier

arXiv:2602.08557·cs.RO·February 10, 2026

Constrained Sampling to Guide Universal Manipulation RL

Marc Toussaint, Cornelius V. Braun, Eckart Cobo-Briesewitz, Sayantan Auddy, Armand Jordana, Justin Carpentier

PDF

Open Access

TL;DR

This paper introduces Sample-Guided RL, a method that uses model-based constraint solvers to efficiently sample feasible states, guiding reinforcement learning to discover complex manipulation strategies in contact-rich environments.

Contribution

The paper presents a novel approach combining model-based sampling with RL to improve exploration and policy learning in complex manipulation tasks.

Findings

01

Achieves high success rates in a double sphere manipulation setting.

02

Demonstrates complex manipulation strategies in a panda arm environment.

03

Outperforms baseline methods in success rate in contact-rich tasks.

Abstract

We consider how model-based solvers can be leveraged to guide training of a universal policy to control from any feasible start state to any feasible goal in a contact-rich manipulation setting. While Reinforcement Learning (RL) has demonstrated its strength in such settings, it may struggle to sufficiently explore and discover complex manipulation strategies, especially in sparse-reward settings. Our approach is based on the idea of a lower-dimensional manifold of feasible, likely-visited states during such manipulation and to guide RL with a sampler from this manifold. We propose Sample-Guided RL, which uses model-based constraint solvers to efficiently sample feasible configurations (satisfying differentiable collision, contact, and force constraints) and leverage them to guide RL for universal (goal-conditioned) manipulation policies. We study using this data directly to bias state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Motor Control and Adaptation