Out-of-Dynamics Imitation Learning from Multimodal Demonstrations

Yiwen Qiu; Jialong Wu; Zhangjie Cao; Mingsheng Long

arXiv:2211.06839·cs.RO·November 15, 2022·1 cites

Out-of-Dynamics Imitation Learning from Multimodal Demonstrations

Yiwen Qiu, Jialong Wu, Zhangjie Cao, Mingsheng Long

PDF

Open Access 1 Repo

TL;DR

This paper introduces out-of-dynamics imitation learning (OOD-IL), enabling imitation from demonstrations with different dynamics by clustering demonstrations and measuring their transferability, improving performance across various environments.

Contribution

The paper proposes a novel sequence-based contrastive clustering and adversarial transferability measurement for out-of-dynamics imitation learning, addressing multimodal demonstration distributions.

Findings

01

More accurate identification of transferable demonstrations

02

Outperforms prior methods in imitation accuracy

03

Effective across diverse environments

Abstract

Existing imitation learning works mainly assume that the demonstrator who collects demonstrations shares the same dynamics as the imitator. However, the assumption limits the usage of imitation learning, especially when collecting demonstrations for the imitator is difficult. In this paper, we study out-of-dynamics imitation learning (OOD-IL), which relaxes the assumption to that the demonstrator and the imitator have the same state spaces but could have different action spaces and dynamics. OOD-IL enables imitation learning to utilize demonstrations from a wide range of demonstrators but introduces a new challenge: some demonstrations cannot be achieved by the imitator due to the different dynamics. Prior works try to filter out such demonstrations by feasibility measurements, but ignore the fact that the demonstrations exhibit a multimodal distribution since the different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

evieq01/oodil
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition