Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for   Robot Learning

Yixiao Wang; Yifei Zhang; Mingxiao Huo; Ran Tian; Xiang Zhang; Yichen; Xie; Chenfeng Xu; Pengliang Ji; Wei Zhan; Mingyu Ding; Masayoshi Tomizuka

arXiv:2407.01531·cs.RO·October 28, 2024·1 cites

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning

Yixiao Wang, Yifei Zhang, Mingxiao Huo, Ran Tian, Xiang Zhang, Yichen, Xie, Chenfeng Xu, Pengliang Ji, Wei Zhan, Mingyu Ding, Masayoshi Tomizuka

PDF

Open Access

TL;DR

The paper introduces Sparse Diffusion Policy (SDP), a novel, efficient, and flexible approach for robot learning that enables multitask and continual learning by selectively activating experts within a transformer-based diffusion framework.

Contribution

It proposes a sparse, reusable policy using Mixture of Experts in a diffusion model, allowing efficient multitask and continual learning without retraining the entire system.

Findings

01

SDP performs well in multitask scenarios with minimal active parameters.

02

SDP prevents catastrophic forgetting in continual learning.

03

SDP enables efficient transfer of skills across tasks.

Abstract

The increasing complexity of tasks in robotics demands efficient strategies for multitask and continual learning. Traditional models typically rely on a universal policy for all tasks, facing challenges such as high computational costs and catastrophic forgetting when learning new tasks. To address these issues, we introduce a sparse, reusable, and flexible policy, Sparse Diffusion Policy (SDP). By adopting Mixture of Experts (MoE) within a transformer-based diffusion policy, SDP selectively activates experts and skills, enabling efficient and task-specific learning without retraining the entire model. SDP not only reduces the burden of active parameters but also facilitates the seamless integration and reuse of experts across various tasks. Extensive experiments on diverse tasks in both simulations and real world show that SDP 1) excels in multitask scenarios with negligible increases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsDiffusion