PoCo: Policy Composition from and for Heterogeneous Robot Learning

Lirui Wang; Jialiang Zhao; Yilun Du; Edward H. Adelson; Russ Tedrake

arXiv:2402.02511·cs.RO·December 3, 2024·1 cites

PoCo: Policy Composition from and for Heterogeneous Robot Learning

Lirui Wang, Jialiang Zhao, Yilun Du, Edward H. Adelson, Russ Tedrake

PDF

Open Access

TL;DR

PoCo introduces a flexible policy composition method that combines diverse heterogeneous data sources using diffusion models to learn generalized manipulation skills for robots, improving performance across tasks and domains.

Contribution

The paper presents a novel Policy Composition approach that effectively integrates multi-domain and multi-modality data for robotic manipulation, enabling robust generalization.

Findings

01

Outperforms single-source baselines in simulation and real-world tasks

02

Achieves robust manipulation across varying scenes and tasks

03

Successfully integrates heterogeneous data for policy learning

Abstract

Training general robotic policies from heterogeneous data for different tasks is a significant challenge. Existing robotic datasets vary in different modalities such as color, depth, tactile, and proprioceptive information, and collected in different domains such as simulation, real robots, and human videos. Current methods usually collect and pool all data from one domain to train a single policy to handle such heterogeneity in tasks and domains, which is prohibitively expensive and difficult. In this work, we present a flexible approach, dubbed Policy Composition, to combine information across such diverse modalities and domains for learning scene-level and task-level generalized manipulation skills, by composing different data distributions represented with diffusion models. Our method can use task-level composition for multi-task manipulation and be composed with analytic cost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Reliability and Analysis Research · Distributed systems and fault tolerance

MethodsDiffusion