Know your Trajectory -- Trustworthy Reinforcement Learning deployment through Importance-Based Trajectory Analysis

Clifford F; Devika Jay; Abhishek Sarkar; Satheesh K Perepu; Santhosh G S; Kaushik Dey; Balaraman Ravindran

arXiv:2512.06917·cs.LG·December 9, 2025

Know your Trajectory -- Trustworthy Reinforcement Learning deployment through Importance-Based Trajectory Analysis

Clifford F, Devika Jay, Abhishek Sarkar, Satheesh K Perepu, Santhosh G S, Kaushik Dey, Balaraman Ravindran

PDF

Open Access

TL;DR

This paper presents a trajectory-level explainability framework for reinforcement learning that ranks entire trajectories using a novel importance metric, enhancing transparency and trustworthiness of RL agents in real-world applications.

Contribution

Introduces a new importance-based trajectory ranking method that captures long-term behavior and provides counterfactual explanations for RL agents.

Findings

01

Effectively identifies optimal trajectories in standard environments.

02

Outperforms classic methods in trajectory importance assessment.

03

Provides robust explanations for agent decisions.

Abstract

As Reinforcement Learning (RL) agents are increasingly deployed in real-world applications, ensuring their behavior is transparent and trustworthy is paramount. A key component of trust is explainability, yet much of the work in Explainable RL (XRL) focuses on local, single-step decisions. This paper addresses the critical need for explaining an agent's long-term behavior through trajectory-level analysis. We introduce a novel framework that ranks entire trajectories by defining and aggregating a new state-importance metric. This metric combines the classic Q-value difference with a "radical term" that captures the agent's affinity to reach its goal, providing a more nuanced measure of state criticality. We demonstrate that our method successfully identifies optimal trajectories from a heterogeneous collection of agent experiences. Furthermore, by generating counterfactual rollouts from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics