Semi-Supervised Cross-Domain Imitation Learning

Li-Min Chu; Kai-Siang Ma; Ming-Hong Chen; Ping-Chun Hsieh

arXiv:2602.10793·cs.LG·February 12, 2026

Semi-Supervised Cross-Domain Imitation Learning

Li-Min Chu, Kai-Siang Ma, Ming-Hong Chen, Ping-Chun Hsieh

PDF

Open Access

TL;DR

This paper introduces a semi-supervised approach to cross-domain imitation learning that leverages limited expert demonstrations and unlabeled trajectories to improve policy transfer across domains.

Contribution

It proposes the first algorithm for semi-supervised cross-domain imitation learning with theoretical support and a novel loss function for domain discrepancy handling.

Findings

01

Achieves stable, data-efficient policy learning with minimal supervision.

02

Outperforms baseline methods on MuJoCo and Robosuite tasks.

03

Demonstrates effectiveness of the proposed cross-domain loss and adaptive weighting.

Abstract

Cross-domain imitation learning (CDIL) accelerates policy learning by transferring expert knowledge across domains, which is valuable in applications where the collection of expert data is costly. Existing methods are either supervised, relying on proxy tasks and explicit alignment, or unsupervised, aligning distributions without paired data, but often unstable. We introduce the Semi-Supervised CDIL (SS-CDIL) setting and propose the first algorithm for SS-CDIL with theoretical justification. Our method uses only offline data, including a small number of target expert demonstrations and some unlabeled imperfect trajectories. To handle domain discrepancy, we propose a novel cross-domain loss function for learning inter-domain state-action mappings and design an adaptive weight function to balance the source and target knowledge. Experiments on MuJoCo and Robosuite show consistent gains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research