A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes

Thomas J. Ringstrom; Paul R. Schrater

arXiv:2506.09499·cs.LG·June 12, 2025

A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes

Thomas J. Ringstrom, Paul R. Schrater

PDF

Open Access

TL;DR

This paper introduces Option Kernel Bellman Equations (OKBEs), a novel framework for goal-oriented, interpretable, and compositional planning in high-dimensional Markov Decision Processes, emphasizing modularity over reward maximization.

Contribution

The paper proposes OKBEs that construct interpretable, compositional transition kernels for policies, enabling scalable, goal-based planning without relying solely on reward maximization.

Findings

01

STOKs enable compositional, modular, and interpretable policy representations.

02

High-dimensional STOKs can be efficiently factorized and computed.

03

OKBEs support verifiable long-horizon planning and intrinsic motivation.

Abstract

We introduce Option Kernel Bellman Equations (OKBEs) for a new reward-free Markov Decision Process. Rather than a value function, OKBEs directly construct and optimize a predictive map called a state-time option kernel (STOK) to maximize the probability of completing a goal while avoiding constraint violations. STOKs are compositional, modular, and interpretable initiation-to-termination transition kernels for policies in the Options Framework of Reinforcement Learning. This means: 1) STOKs can be composed using Chapman-Kolmogorov equations to make spatiotemporal predictions for multiple policies over long horizons, 2) high-dimensional STOKs can be represented and computed efficiently in a factorized and reconfigurable form, and 3) STOKs record the probabilities of semantically interpretable goal-success and constraint-violation events, needed for formal verification. Given a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Embodied and Extended Cognition