SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li, Romit Maulik

TL;DR
SALSA-RL introduces a framework that models RL actions in a latent space, enabling interpretability through local stability analysis without affecting agent performance.
Contribution
It presents a novel method for analyzing RL actions' stability in a latent space, enhancing interpretability and safety assessment of RL agents.
Findings
SALSA-RL can predict action stability before execution.
The method is non-invasive and preserves RL performance.
Applicable across diverse benchmark environments.
Abstract
Modern deep reinforcement learning (DRL) methods have made significant advances in handling continuous action spaces. However, real-world control systems, especially those requiring precise and reliable performance, often demand interpretability in the sense of a-priori assessments of agent behavior to identify safe or failure-prone interactions with environments. To address this limitation, this work proposes SALSA-RL (Stability Analysis in the Latent Space of Actions), a novel RL framework that models control actions as dynamic, time-dependent variables evolving within a latent space. By employing a pre-trained encoder-decoder and a state-dependent linear system, this approach enables interpretability through local stability analysis, where instantaneous growth in action-norms can be predicted before their execution. It is demonstrated that SALSA-RL can be deployed in a non-invasive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
