Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers

Leonardo Guiducci; Antonio Rizzo; Giovanna Maria Dimitri

arXiv:2506.13958·cs.LG·November 18, 2025

Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers

Leonardo Guiducci, Antonio Rizzo, Giovanna Maria Dimitri

PDF

Open Access

TL;DR

This paper introduces a post-hoc explainability framework for Elastic Decision Transformers in offline reinforcement learning, revealing how intrinsic motivation influences learned representations and improves policy performance.

Contribution

It provides a systematic analysis of how intrinsic motivation mechanisms shape embedding structures in EDTs, uncovering environment-specific representational patterns.

Findings

01

Intrinsic motivation creates distinct embedding structures.

02

Embedding metrics correlate with performance in environment-specific ways.

03

Intrinsic motivation acts as a representational prior shaping decision-making.

Abstract

Elastic Decision Transformers (EDTs) have proved to be particularly successful in offline reinforcement learning, offering a flexible framework that unifies sequence modeling with decision-making under uncertainty. Recent research has shown that incorporating intrinsic motivation mechanisms into EDTs improves performance across exploration tasks, yet the representational mechanisms underlying these improvements remain unexplored. In this paper, we introduce a systematic post-hoc explainability framework to analyze how intrinsic motivation shapes learned embeddings in EDTs. Through statistical analysis of embedding properties (including covariance structure, vector magnitudes, and orthogonality), we reveal that different intrinsic motivation variants create fundamentally different representational structures. Our analysis demonstrates environment-specific correlation patterns between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis · Information and Cyber Security