Policy Stitching: Learning Transferable Robot Policies

Pingcheng Jian; Easop Lee; Zachary Bell; Michael M. Zavlanos; Boyuan; Chen

arXiv:2309.13753·cs.RO·September 26, 2023

Policy Stitching: Learning Transferable Robot Policies

Pingcheng Jian, Easop Lee, Zachary Bell, Michael M. Zavlanos, Boyuan, Chen

PDF

Open Access

TL;DR

Policy Stitching introduces a modular transfer learning framework for robots, enabling rapid adaptation to new tasks and configurations by stitching separately trained modules, demonstrated through superior zero-shot and few-shot transfer results.

Contribution

It proposes a novel modular policy framework with latent space alignment, allowing direct stitching of trained modules for efficient transfer learning in robotics.

Findings

01

Superior zero-shot transfer performance in simulated and real-world tasks

02

Effective few-shot adaptation with minimal additional training

03

Outperforms existing transfer learning methods in complex manipulation tasks

Abstract

Training robots with reinforcement learning (RL) typically involves heavy interactions with the environment, and the acquired skills are often sensitive to changes in task environments and robot kinematics. Transfer RL aims to leverage previous knowledge to accelerate learning of new tasks or new body configurations. However, existing methods struggle to generalize to novel robot-task combinations and scale to realistic tasks due to complex architecture design or strong regularization that limits the capacity of the learned policy. We propose Policy Stitching, a novel framework that facilitates robot transfer learning for novel combinations of robots and tasks. Our key idea is to apply modular policy design and align the latent representations between the modular interfaces. Our method allows direct stitching of the robot and task modules trained separately to form a new policy for fast…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Domain Adaptation and Few-Shot Learning