A Generalized Framework for Kullback-Leibler Markov Aggregation
Rana Ali Amjad, Clemens Bl\"ochl, Bernhard C. Geiger

TL;DR
This paper introduces a unified information-theoretic framework for Markov chain aggregation that generalizes previous methods, balancing Markovianity and temporal dependence preservation, with practical heuristics demonstrated on synthetic data.
Contribution
It presents a generalized cost function for Markov aggregation that encompasses existing approaches and analyzes their limitations, offering new insights and optimization heuristics.
Findings
The proposed cost function includes previous methods as special cases.
Geiger et al.'s cost function can lead to trivial solutions without regularization.
The heuristic effectively optimizes the cost function on synthetic examples.
Abstract
This paper proposes an information-theoretic cost function for aggregating a Markov chain via a (possibly stochastic) mapping. The cost function is motivated by two objectives: 1) The process obtained by observing the Markov chain through the mapping should be close to a Markov chain, and 2) the aggregated Markov chain should retain as much of the temporal dependence structure of the original Markov chain as possible. We discuss properties of this parameterized cost function and show that it contains the cost functions previously proposed by Deng et al., Xu et al., and Geiger et al. as special cases. We moreover discuss these special cases providing a better understanding and highlighting potential shortcomings: For example, the cost function proposed by Geiger et al. is tightly connected to approximate probabilistic bisimulation, but leads to trivial solutions if optimized without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
