SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning

Xuyang Li; Romit Maulik

arXiv:2502.15512·cs.LG·April 10, 2026

SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning

Xuyang Li, Romit Maulik

PDF

TL;DR

SALSA-RL introduces a framework that models RL actions in a latent space, enabling interpretability through local stability analysis without affecting agent performance.

Contribution

It presents a novel method for analyzing RL actions' stability in a latent space, enhancing interpretability and safety assessment of RL agents.

Findings

01

SALSA-RL can predict action stability before execution.

02

The method is non-invasive and preserves RL performance.

03

Applicable across diverse benchmark environments.

Abstract

Modern deep reinforcement learning (DRL) methods have made significant advances in handling continuous action spaces. However, real-world control systems, especially those requiring precise and reliable performance, often demand interpretability in the sense of a-priori assessments of agent behavior to identify safe or failure-prone interactions with environments. To address this limitation, this work proposes SALSA-RL (Stability Analysis in the Latent Space of Actions), a novel RL framework that models control actions as dynamic, time-dependent variables evolving within a latent space. By employing a pre-trained encoder-decoder and a state-dependent linear system, this approach enables interpretability through local stability analysis, where instantaneous growth in action-norms can be predicted before their execution. It is demonstrated that SALSA-RL can be deployed in a non-invasive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.