Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng, Biwei Huang, Kun Zhang, Sara Magliacane

TL;DR
This paper introduces FANS-RL, a novel approach that models non-stationarity in reinforcement learning using causal structures and factored representations, improving adaptability and robustness in changing environments.
Contribution
It proposes a causal, factored adaptation framework for non-stationary RL that can recover underlying causal graphs and handle diverse non-stationary scenarios.
Findings
FANS-RL outperforms existing methods in return and robustness.
It achieves a compact latent state representation.
The approach effectively handles different types and frequencies of non-stationarity.
Abstract
Dealing with non-stationarity in environments (e.g., in the transition dynamics) and objectives (e.g., in the reward functions) is a challenging problem that is crucial in real-world applications of reinforcement learning (RL). While most current approaches model the changes as a single shared embedding vector, we leverage insights from the recent causality literature to model non-stationarity in terms of individual latent change factors, and causal graphs across different environments. In particular, we propose Factored Adaptation for Non-Stationary RL (FANS-RL), a factored adaption approach that learns jointly both the causal structure in terms of a factored MDP, and a factored representation of the individual time-varying change factors. We prove that under standard assumptions, we can completely recover the causal graph representing the factored transition and reward function, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMental Health Research Topics
