Distances for Markov chains from sample streams
Sergio Calo, Anders Jonsson, Gergely Neu, Ludovic Schwartz, Javier Segovia-Aguas

TL;DR
This paper introduces a stochastic optimization approach to estimate bisimulation metrics for Markov chains using only sample trajectories, removing the need for explicit transition models and enabling practical applications.
Contribution
It presents a new LP formulation and a stochastic primal-dual algorithm for estimating bisimulation metrics from samples, with theoretical guarantees and empirical validation.
Findings
The method accurately estimates bisimulation metrics from sample data.
The approach has provable sample complexity bounds.
Empirical results demonstrate effectiveness on real-world data.
Abstract
Bisimulation metrics are powerful tools for measuring similarities between stochastic processes, and specifically Markov chains. Recent advances have uncovered that bisimulation metrics are, in fact, optimal-transport distances, which has enabled the development of fast algorithms for computing such metrics with provable accuracy and runtime guarantees. However, these recent methods, as well as all previously known methods, assume full knowledge of the transition dynamics. This is often an impractical assumption in most real-world scenarios, where typically only sample trajectories are available. In this work, we propose a stochastic optimization method that addresses this limitation and estimates bisimulation metrics based on sample access, without requiring explicit transition models. Our approach is derived from a new linear programming (LP) formulation of bisimulation metrics, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
