Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
Ali Saheb Pasand, Johan Obando-Ceron, Aaron Courville, Pouya Bashivan, Pablo Samuel Castro

TL;DR
This paper introduces isotropic Gaussian embeddings and a regularization method to enhance stability and adaptability in deep reinforcement learning under non-stationary conditions.
Contribution
It proposes Sketched Isotropic Gaussian Regularization to shape representations, improving stability and performance in non-stationary reinforcement learning environments.
Findings
Improved stability and performance in various domains.
Reduced representation collapse and neuron dormancy.
Enhanced adaptability of agents.
Abstract
Deep reinforcement learning systems often suffer from unstable training dynamics due to non-stationarity, where learning objectives and data distributions evolve over time. We show that under non-stationary targets, isotropic Gaussian embeddings are provably advantageous. In particular, they induce stable tracking of time-varying targets for linear readouts, achieve maximal entropy under a fixed variance budget, and encourage a balanced use of all representational dimensions--all of which enable agents to be more adaptive and stable. Building on this insight, we propose the use of Sketched Isotropic Gaussian Regularization for shaping representations toward an isotropic Gaussian distribution during training. We demonstrate empirically, over a variety of domains, that this simple and computationally inexpensive method improves performance under non-stationarity while reducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference
