A Metric Between Probability Distributions on Finite Sets of Different Cardinalities and Applications to Order Reduction
Mathukumalli Vidyasagar

TL;DR
This paper introduces a new metric for measuring the distance between probability distributions on finite sets of different sizes, with applications to data compression and order reduction in stochastic processes.
Contribution
It defines a novel entropy-based metric for distributions on different finite sets and provides greedy algorithms for its computation and approximation, addressing NP-hard challenges.
Findings
The metric is NP-hard to compute exactly.
Greedy algorithms effectively approximate the metric.
Enables reduction of process order while preserving fidelity.
Abstract
With increasing use of digital control it is natural to view control inputs and outputs as stochastic processes assuming values over finite alphabets rather than in a Euclidean space. As control over networks becomes increasingly common, data compression by reducing the size of the input and output alphabets without losing the fidelity of representation becomes relevant. This requires us to define a notion of distance between two stochastic processes assuming values in distinct sets, possibly of different cardinalities. If the two processes are i.i.d., then the problem becomes one of defining a metric between two probability distributions over distinct finite sets of possibly different cardinalities. This is the problem addressed in the present paper. A metric is defined in terms of a joint distribution on the product of the two sets, which has the two given distributions as its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
