Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning

Zhangjie Xia; Yu Yang; Pan Xu

arXiv:2602.21072·cs.LG·February 25, 2026

Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning

Zhangjie Xia, Yu Yang, Pan Xu

PDF

Open Access

TL;DR

This paper introduces LoDADA, a localized domain adaptation method for offline RL that clusters transitions to better handle local dynamics mismatch, improving policy learning across diverse environments.

Contribution

LoDADA is a novel clustering-based approach that estimates and filters source data based on localized dynamics discrepancies, enhancing data reuse in off-dynamics offline RL.

Findings

01

LoDADA outperforms existing methods in diverse environments.

02

It effectively leverages localized dynamics mismatch.

03

The method is scalable and computationally efficient.

Abstract

Off-dynamics offline reinforcement learning (RL) aims to learn a policy for a target domain using limited target data and abundant source data collected under different transition dynamics. Existing methods typically address dynamics mismatch either globally over the state space or via pointwise data filtering; these approaches can miss localized cross-domain similarities or incur high computational cost. We propose Localized Dynamics-Aware Domain Adaptation (LoDADA), which exploits localized dynamics mismatch to better reuse source data. LoDADA clusters transitions from source and target datasets and estimates cluster-level dynamics discrepancy via domain discrimination. Source transitions from clusters with small discrepancy are retained, while those from clusters with large discrepancy are filtered out. This yields a fine-grained and scalable data selection strategy that avoids…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Face recognition and analysis