Comparing weighted difference and earth mover's distance via Young diagrams
William Q. Erickson

TL;DR
This paper investigates the relationship between two histogram distance metrics, weighted difference and earth mover's distance, deriving formulas for their probability and expectation using Young diagrams, and extends the analysis to multiple histograms.
Contribution
It introduces a combinatorial approach using Young diagrams to analyze the probability and expected value of the earth mover's distance equaling the weighted difference, and generalizes to multiple histograms.
Findings
Derived a formula for the probability that EMD equals the absolute difference.
Calculated the expected value of the weighted difference between histograms.
Extended the analysis to multiple histograms using root lattice geometry.
Abstract
We consider two natural statistics on pairs of histograms, in which the bins have weights . The difference () between the weighted totals of the histograms is, in a sense, refined by the earth mover's distance (), which measures the amount of work required to equalize the histograms. We were recently surprised, however, by how little actually does refine in certain real-world applications, which led to the main problem in this paper: what is the probability that ? We derive a formula for this probability, as well as the expected value of , via the combinatorics of Young diagrams and plane partitions. We then generalize our results to an arbitrary number of histograms, where we realize this higher-dimensional as distance on the Type-A root lattice.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Combinatorial Mathematics · Bayesian Methods and Mixture Models · Advanced Mathematical Identities
