Rethinking State Disentanglement in Causal Reinforcement Learning
Haiyao Cao, Zhen Zhang, Panpan Cai, Yuhang Liu, Jinan Zou, Ehsan, Abbasnejad, Biwei Huang, Mingming Gong, Anton van den Hengel, Javen Qinfeng, Shi

TL;DR
This paper introduces a new approach to disentangle latent states from noise in reinforcement learning by leveraging causal insights specific to RL, reducing assumptions, and improving performance in POMDPs.
Contribution
It proposes a novel algorithm that simplifies structural constraints for state disentanglement in POMDPs, guided by RL-specific causal analysis.
Findings
Outperforms existing methods in benchmark control tasks
Effectively disentangles state belief from noise
Reduces assumptions in identifiability analysis
Abstract
One of the significant challenges in reinforcement learning (RL) when dealing with noise is estimating latent states from observations. Causality provides rigorous theoretical support for ensuring that the underlying states can be uniquely recovered through identifiability. Consequently, some existing work focuses on establishing identifiability from a causal perspective to aid in the design of algorithms. However, these results are often derived from a purely causal viewpoint, which may overlook the specific RL context. We revisit this research line and find that incorporating RL-specific context can reduce unnecessary assumptions in previous identifiability analyses for latent states. More importantly, removing these assumptions allows algorithm design to go beyond the earlier boundaries constrained by them. Leveraging these insights, we propose a novel approach for general partially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
