Constructing Non-Markovian Decision Process via History Aggregator

Yongyi Wang; Wenxin Li

arXiv:2506.24026·cs.AI·July 1, 2025

Constructing Non-Markovian Decision Process via History Aggregator

Yongyi Wang, Wenxin Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a category-theoretic framework and a History Aggregator for State (HAS) to model, analyze, and evaluate decision-making processes with non-Markovian dynamics, addressing a key gap in existing benchmarks.

Contribution

It establishes a theoretical equivalence between Markov and non-Markovian decision processes and proposes HAS to control and incorporate non-Markovianity in decision-making settings.

Findings

01

Theoretical foundation for non-Markovian decision processes via category theory.

02

Effective representation of diverse non-Markovian dynamics.

03

Enhanced evaluation of decision algorithms in non-Markovian environments.

Abstract

In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing the capacity of decision algorithms to handle non-Markovian dynamics. To address this deficiency, we have devised a generalized methodology grounded in category theory. Notably, we established the category of Markov Decision Processes (MDP) and the category of non-Markovian Decision Processes (NMDP), and proved the equivalence relationship between them. This theoretical foundation provides a novel perspective for understanding and addressing non-Markovian dynamics. We further introduced non-Markovianity into decision-making problem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

2664139264/sequential_rl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Embodied and Extended Cognition · Machine Learning and Algorithms