Diffusion Meets Options: Hierarchical Generative Skill Composition for   Temporally-Extended Tasks

Zeyu Feng; Hao Luan; Kevin Yuchen Ma; Harold Soh

arXiv:2410.02389·cs.RO·October 4, 2024

Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks

Zeyu Feng, Hao Luan, Kevin Yuchen Ma, Harold Soh

PDF

Open Access

TL;DR

This paper introduces DOPPLER, a hierarchical framework combining diffusion models and reinforcement learning to generate and update long-horizon, temporally-extended plans for robots based on linear temporal logic instructions.

Contribution

It presents a novel hierarchical approach that integrates diffusion models with reinforcement learning for efficient, instruction-driven trajectory planning in robotics.

Findings

01

DOPPLER effectively generates trajectories satisfying complex temporal objectives.

02

The method improves speed and diversity of options through determinantal-guided sampling.

03

Experiments show successful navigation and manipulation tasks with obstacle avoidance and visitation sequences.

Abstract

Safe and successful deployment of robots requires not only the ability to generate complex plans but also the capacity to frequently replan and correct execution errors. This paper addresses the challenge of long-horizon trajectory planning under temporally extended objectives in a receding horizon manner. To this end, we propose DOPPLER, a data-driven hierarchical framework that generates and updates plans based on instruction specified by linear temporal logic (LTL). Our method decomposes temporal tasks into chain of options with hierarchical reinforcement learning from offline non-expert datasets. It leverages diffusion models to generate options with low-level actions. We devise a determinantal-guided posterior sampling technique during batch generation, which improves the speed and diversity of diffusion generated options, leading to more efficient querying. Experiments on robot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCreativity in Education and Neuroscience

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion