Data Fusion-Enhanced Decision Transformer for Stable Cross-Domain Generalization
Guojian Wang, Quinson Hon, Xuyang Chen, Lin Zhao

TL;DR
This paper introduces DFDT, a novel data fusion approach that enhances decision transformers for better cross-domain generalization by aligning source and target data and improving token continuity.
Contribution
DFDT proposes a new data fusion pipeline with two-level filtering, MMD and OT measures, and advantage-conditioned tokens to improve cross-domain policy transfer.
Findings
DFDT outperforms existing methods in return and stability across various control tasks.
Theoretical bounds link policy performance to MMD and OT measures.
Sequence analysis confirms improved token continuity and semantics.
Abstract
Cross-domain shifts present a significant challenge for decision transformer (DT) policies. Existing cross-domain policy adaptation methods typically rely on a single simple filtering criterion to select source trajectory fragments and stitch them together. They match either state structure or action feasibility. However, the selected fragments still have poor stitchability: state structures can misalign, the return-to-go (RTG) becomes incomparable when the reward or horizon changes, and actions may jump at trajectory junctions. As a result, RTG tokens lose continuity, which compromises DT's inference ability. To tackle these challenges, we propose Data Fusion-Enhanced Decision Transformer (DFDT), a compact pipeline that restores stitchability. Particularly, DFDT fuses scarce target data with selectively trusted source fragments via a two-level data filter, maximum mean discrepancy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis
