BEACON: Cross-Domain Co-Training of Generative Robot Policies via Best-Effort Adaptation

Antong Zhang; Han Qi; Heng Yang

arXiv:2605.08571·cs.RO·May 13, 2026

BEACON: Cross-Domain Co-Training of Generative Robot Policies via Best-Effort Adaptation

Antong Zhang, Han Qi, Heng Yang

PDF

TL;DR

BEACON is a theory-driven framework that enhances cross-domain robot policy training by discrepancy-aware co-training, improving robustness and data efficiency in various manipulation settings.

Contribution

It introduces a novel discrepancy-aware importance reweighting approach for co-training generative robot policies across domains.

Findings

01

BEACON outperforms baseline methods in sim-to-sim, sim-to-real, and multi-source settings.

02

It achieves implicit feature alignment without explicit alignment objectives.

03

The framework improves robustness and data efficiency in robot policy training.

Abstract

We introduce BEACON--Best-Effort Adaptation for Cross-Domain Co-Training--a theory-driven framework for training generative robot policies with abundant source demonstrations and limited target demonstrations. BEACON casts cross-domain co-training as a discrepancy-aware importance-reweighting problem, jointly learning a diffusion-based visuomotor policy and per-sample source weights that minimize an objective informed by target-domain generalization guarantees. To make best-effort adaptation practical for high-dimensional sequence policies, we develop scalable instance-level discrepancy estimators, stochastic alternating updates for policy and weights, and a multi-source extension that balances heterogeneous source domains. Across sim-to-sim, sim-to-real, and multi-source manipulation settings, BEACON improves robustness and data efficiency over target-only, fixed-ratio co-training, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.