Iterative Residual Policy: for Goal-Conditioned Dynamic Manipulation of Deformable Objects
Cheng Chi, Benjamin Burchfiel, Eric Cousineau, Siyuan Feng, Shuran, Song

TL;DR
This paper introduces the Iterative Residual Policy (IRP), a learning framework for goal-conditioned dynamic manipulation of deformable objects that generalizes well from simulation to real-world scenarios.
Contribution
IRP learns delta dynamics instead of full system models, enabling efficient online optimization and robust generalization across different objects, tasks, and hardware.
Findings
IRP successfully manipulates deformable objects like ropes and cloth in simulation.
IRP generalizes from simulation to real-world with noisy dynamics.
IRP adapts to unseen objects and different robot hardware.
Abstract
This paper tackles the task of goal-conditioned dynamic manipulation of deformable objects. This task is highly challenging due to its complex dynamics (introduced by object deformation and high-speed action) and strict task requirements (defined by a precise goal specification). To address these challenges, we present Iterative Residual Policy (IRP), a general learning framework applicable to repeatable tasks with complex dynamics. IRP learns an implicit policy via delta dynamics -- instead of modeling the entire dynamical system and inferring actions from that model, IRP learns delta dynamics that predict the effects of delta action on the previously-observed trajectory. When combined with adaptive action sampling, the system can quickly optimize its actions online to reach a specified goal. We demonstrate the effectiveness of IRP on two tasks: whipping a rope to hit a target point…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition
