PoCo: Policy Composition from and for Heterogeneous Robot Learning
Lirui Wang, Jialiang Zhao, Yilun Du, Edward H. Adelson, Russ Tedrake

TL;DR
PoCo introduces a flexible policy composition method that combines diverse heterogeneous data sources using diffusion models to learn generalized manipulation skills for robots, improving performance across tasks and domains.
Contribution
The paper presents a novel Policy Composition approach that effectively integrates multi-domain and multi-modality data for robotic manipulation, enabling robust generalization.
Findings
Outperforms single-source baselines in simulation and real-world tasks
Achieves robust manipulation across varying scenes and tasks
Successfully integrates heterogeneous data for policy learning
Abstract
Training general robotic policies from heterogeneous data for different tasks is a significant challenge. Existing robotic datasets vary in different modalities such as color, depth, tactile, and proprioceptive information, and collected in different domains such as simulation, real robots, and human videos. Current methods usually collect and pool all data from one domain to train a single policy to handle such heterogeneity in tasks and domains, which is prohibitively expensive and difficult. In this work, we present a flexible approach, dubbed Policy Composition, to combine information across such diverse modalities and domains for learning scene-level and task-level generalized manipulation skills, by composing different data distributions represented with diffusion models. Our method can use task-level composition for multi-task manipulation and be composed with analytic cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Software Reliability and Analysis Research · Distributed systems and fault tolerance
MethodsDiffusion
