Rethinking State Disentanglement in Causal Reinforcement Learning

Haiyao Cao; Zhen Zhang; Panpan Cai; Yuhang Liu; Jinan Zou; Ehsan; Abbasnejad; Biwei Huang; Mingming Gong; Anton van den Hengel; Javen Qinfeng; Shi

arXiv:2408.13498·cs.LG·August 27, 2024

Rethinking State Disentanglement in Causal Reinforcement Learning

Haiyao Cao, Zhen Zhang, Panpan Cai, Yuhang Liu, Jinan Zou, Ehsan, Abbasnejad, Biwei Huang, Mingming Gong, Anton van den Hengel, Javen Qinfeng, Shi

PDF

Open Access

TL;DR

This paper introduces a new approach to disentangle latent states from noise in reinforcement learning by leveraging causal insights specific to RL, reducing assumptions, and improving performance in POMDPs.

Contribution

It proposes a novel algorithm that simplifies structural constraints for state disentanglement in POMDPs, guided by RL-specific causal analysis.

Findings

01

Outperforms existing methods in benchmark control tasks

02

Effectively disentangles state belief from noise

03

Reduces assumptions in identifiability analysis

Abstract

One of the significant challenges in reinforcement learning (RL) when dealing with noise is estimating latent states from observations. Causality provides rigorous theoretical support for ensuring that the underlying states can be uniquely recovered through identifiability. Consequently, some existing work focuses on establishing identifiability from a causal perspective to aid in the design of algorithms. However, these results are often derived from a purely causal viewpoint, which may overlook the specific RL context. We revisit this research line and find that incorporating RL-specific context can reduce unnecessary assumptions in previous identifiability analyses for latent states. More importantly, removing these assumptions allows algorithm design to go beyond the earlier boundaries constrained by them. Leveraging these insights, we propose a novel approach for general partially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning