Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers
Leonardo Guiducci, Antonio Rizzo, Giovanna Maria Dimitri

TL;DR
This paper introduces a post-hoc explainability framework for Elastic Decision Transformers in offline reinforcement learning, revealing how intrinsic motivation influences learned representations and improves policy performance.
Contribution
It provides a systematic analysis of how intrinsic motivation mechanisms shape embedding structures in EDTs, uncovering environment-specific representational patterns.
Findings
Intrinsic motivation creates distinct embedding structures.
Embedding metrics correlate with performance in environment-specific ways.
Intrinsic motivation acts as a representational prior shaping decision-making.
Abstract
Elastic Decision Transformers (EDTs) have proved to be particularly successful in offline reinforcement learning, offering a flexible framework that unifies sequence modeling with decision-making under uncertainty. Recent research has shown that incorporating intrinsic motivation mechanisms into EDTs improves performance across exploration tasks, yet the representational mechanisms underlying these improvements remain unexplored. In this paper, we introduce a systematic post-hoc explainability framework to analyze how intrinsic motivation shapes learned embeddings in EDTs. Through statistical analysis of embedding properties (including covariance structure, vector magnitudes, and orthogonality), we reveal that different intrinsic motivation variants create fundamentally different representational structures. Our analysis demonstrates environment-specific correlation patterns between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Information and Cyber Security
