Cross-Domain Policy Transfer by Representation Alignment via   Multi-Domain Behavioral Cloning

Hayato Watahiki; Ryo Iwase; Ryosuke Unno; Yoshimasa Tsuruoka

arXiv:2407.16912·cs.LG·July 25, 2024

Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning

Hayato Watahiki, Ryo Iwase, Ryosuke Unno, Yoshimasa Tsuruoka

PDF

1 Repo

TL;DR

This paper introduces a simple yet effective method for cross-domain policy transfer that learns a shared latent space and an abstract policy using multi-domain behavioral cloning and MMD regularization, outperforming prior domain translation approaches.

Contribution

It proposes a novel approach combining multi-domain behavioral cloning with MMD regularization to improve cross-domain policy transfer, especially under significant domain gaps.

Findings

01

Higher transfer performance with MMD regularization.

02

Effective in cross-morphology and cross-viewpoint scenarios.

03

Single multi-domain policy simplifies extension.

Abstract

Transferring learned skills across diverse situations remains a fundamental challenge for autonomous agents, particularly when agents are not allowed to interact with an exact target setup. While prior approaches have predominantly focused on learning domain translation, they often struggle with handling significant domain gaps or out-of-distribution tasks. In this paper, we present a simple approach for cross-domain policy transfer that learns a shared latent representation across domains and a common abstract policy on top of it. Our approach leverages multi-domain behavioral cloning on unaligned trajectories of proxy tasks and employs maximum mean discrepancy (MMD) as a regularization term to encourage cross-domain alignment. The MMD regularization better preserves structures of latent state distributions than commonly used domain-discriminative distribution matching, leading to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hwatahiki/portable-latent-policy
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.