Out-of-Dynamics Imitation Learning from Multimodal Demonstrations
Yiwen Qiu, Jialong Wu, Zhangjie Cao, Mingsheng Long

TL;DR
This paper introduces out-of-dynamics imitation learning (OOD-IL), enabling imitation from demonstrations with different dynamics by clustering demonstrations and measuring their transferability, improving performance across various environments.
Contribution
The paper proposes a novel sequence-based contrastive clustering and adversarial transferability measurement for out-of-dynamics imitation learning, addressing multimodal demonstration distributions.
Findings
More accurate identification of transferable demonstrations
Outperforms prior methods in imitation accuracy
Effective across diverse environments
Abstract
Existing imitation learning works mainly assume that the demonstrator who collects demonstrations shares the same dynamics as the imitator. However, the assumption limits the usage of imitation learning, especially when collecting demonstrations for the imitator is difficult. In this paper, we study out-of-dynamics imitation learning (OOD-IL), which relaxes the assumption to that the demonstrator and the imitator have the same state spaces but could have different action spaces and dynamics. OOD-IL enables imitation learning to utilize demonstrations from a wide range of demonstrators but introduces a new challenge: some demonstrations cannot be achieved by the imitator due to the different dynamics. Prior works try to filter out such demonstrations by feasibility measurements, but ignore the fact that the demonstrations exhibit a multimodal distribution since the different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
