CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation
Zhengzhe Zhang, Wenjia Meng, Haoliang Sun, Gang Pan

TL;DR
CausalCOMRL introduces causal representation learning into offline meta-reinforcement learning to improve task generalization and reduce spurious correlations, leading to better policy performance across benchmarks.
Contribution
It is the first to integrate causal representations into context-based offline meta-RL, enhancing task generalization and robustness against confounders.
Findings
Outperforms existing methods on most meta-RL benchmarks.
Effectively uncovers causal relationships among task components.
Improves task representation distinction using mutual information and contrastive learning.
Abstract
Context-based offline meta-reinforcement learning (OMRL) methods have achieved appealing success by leveraging pre-collected offline datasets to develop task representations that guide policy learning. However, current context-based OMRL methods often introduce spurious correlations, where task components are incorrectly correlated due to confounders. These correlations can degrade policy performance when the confounders in the test task differ from those in the training task. To address this problem, we propose CausalCOMRL, a context-based OMRL method that integrates causal representation learning. This approach uncovers causal relationships among the task components and incorporates the causal relationships into task representations, enhancing the generalizability of RL agents. We further improve the distinction of task representations from different tasks by using mutual information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques
