SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from   Sparse and Noisy Demonstrations

Runyi Yu; Yinhuai Wang; Qihan Zhao; Hok Wai Tsui; Jingbo; Wang; Ping Tan; Qifeng Chen

arXiv:2505.02094·cs.LG·May 6, 2025

SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations

Runyi Yu, Yinhuai Wang, Qihan Zhao, Hok Wai Tsui, Jingbo, Wang, Ping Tan, Qifeng Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for reinforcement learning from noisy, sparse demonstrations, utilizing data augmentation and adaptive sampling to improve skill generalization and robustness.

Contribution

It proposes two innovative data augmentation techniques, STG and STF, along with ATS and historical encoding, to enhance learning from imperfect demonstration data.

Findings

01

Significant improvements in convergence stability.

02

Enhanced generalization to unseen skills.

03

Robustness in recovery from noisy demonstrations.

Abstract

We address a fundamental challenge in Reinforcement Learning from Interaction Demonstration (RLID): demonstration noise and coverage limitations. While existing data collection approaches provide valuable interaction demonstrations, they often yield sparse, disconnected, and noisy trajectories that fail to capture the full spectrum of possible skill variations and transitions. Our key insight is that despite noisy and sparse demonstrations, there exist infinite physically feasible trajectories that naturally bridge between demonstrated skills or emerge from their neighboring states, forming a continuous space of possible skill variations and transitions. Building upon this insight, we present two data augmentation techniques: a Stitched Trajectory Graph (STG) that discovers potential transitions between demonstration skills, and a State Transition Field (STF) that establishes unique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Ingrid789/SkillMimic-V2
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques