Learning Parameterized Skills from Demonstrations

Vedant Gupta; Haotian Fu; Calvin Luo; Yiding Jiang; George Konidaris

arXiv:2510.24095·cs.LG·October 29, 2025

Learning Parameterized Skills from Demonstrations

Vedant Gupta, Haotian Fu, Calvin Luo, Yiding Jiang, George Konidaris

PDF

TL;DR

This paper introduces DEPS, an end-to-end algorithm that learns interpretable, parameterized skills from expert demonstrations, improving generalization to new tasks and outperforming existing baselines on benchmark datasets.

Contribution

DEPS jointly learns parameterized skill policies and a meta-policy using temporal variational inference, addressing latent variable degeneracy and enhancing skill interpretability.

Findings

01

Outperforms baseline methods on LIBERO and MetaWorld benchmarks.

02

Learns semantically meaningful, temporally extended skills.

03

Discovers interpretable skills with continuous parameters, like grasp locations.

Abstract

We present DEPS, an end-to-end algorithm for discovering parameterized skills from expert demonstrations. Our method learns parameterized skill policies jointly with a meta-policy that selects the appropriate discrete skill and continuous parameters at each timestep. Using a combination of temporal variational inference and information-theoretic regularization methods, we address the challenge of degeneracy common in latent variable models, ensuring that the learned skills are temporally extended, semantically meaningful, and adaptable. We empirically show that learning parameterized skills from multitask expert demonstrations significantly improves generalization to unseen tasks. Our method outperforms multitask as well as skill learning baselines on both LIBERO and MetaWorld benchmarks. We also demonstrate that DEPS discovers interpretable parameterized skills, such as an object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.