Towards Generalizable Reinforcement Learning via Causality-Guided   Self-Adaptive Representations

Yupei Yang; Biwei Huang; Fan Feng; Xinyue Wang; Shikui Tu; Lei Xu

arXiv:2407.20651·cs.LG·March 7, 2025

Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations

Yupei Yang, Biwei Huang, Fan Feng, Xinyue Wang, Shikui Tu, Lei Xu

PDF

Open Access

TL;DR

This paper introduces CSR, a causality-guided self-adaptive representation method that enables reinforcement learning agents to generalize across environments with changing dynamics and structures, improving adaptability with minimal samples.

Contribution

The paper presents a novel causality-based approach for RL that characterizes latent causal variables, allowing agents to adapt to diverse environment changes more effectively than existing methods.

Findings

01

CSR outperforms state-of-the-art baselines in various environments.

02

It adapts efficiently with only a few samples.

03

The method generalizes across different tasks with evolving dynamics.

Abstract

General intelligence requires quick adaption across tasks. While existing reinforcement learning (RL) methods have made progress in generalization, they typically assume only distribution changes between source and target domains. In this paper, we explore a wider range of scenarios where not only the distribution but also the environment spaces may change. For example, in the CoinRun environment, we train agents from easy levels and generalize them to difficulty levels where there could be new enemies that have never occurred before. To address this challenging setting, we introduce a causality-guided self-adaptive representation-based approach, called CSR, that equips the agent to generalize effectively across tasks with evolving dynamics. Specifically, we employ causal representation learning to characterize the latent causal variables within the RL system. Such compact causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics