Constructing Non-Markovian Decision Process via History Aggregator
Yongyi Wang, Wenxin Li

TL;DR
This paper introduces a category-theoretic framework and a History Aggregator for State (HAS) to model, analyze, and evaluate decision-making processes with non-Markovian dynamics, addressing a key gap in existing benchmarks.
Contribution
It establishes a theoretical equivalence between Markov and non-Markovian decision processes and proposes HAS to control and incorporate non-Markovianity in decision-making settings.
Findings
Theoretical foundation for non-Markovian decision processes via category theory.
Effective representation of diverse non-Markovian dynamics.
Enhanced evaluation of decision algorithms in non-Markovian environments.
Abstract
In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing the capacity of decision algorithms to handle non-Markovian dynamics. To address this deficiency, we have devised a generalized methodology grounded in category theory. Notably, we established the category of Markov Decision Processes (MDP) and the category of non-Markovian Decision Processes (NMDP), and proved the equivalence relationship between them. This theoretical foundation provides a novel perspective for understanding and addressing non-Markovian dynamics. We further introduced non-Markovianity into decision-making problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Embodied and Extended Cognition · Machine Learning and Algorithms
