Investigation of Alternative Measures for Mutual Information
Bulut Kuskonmaz, Jaron Skovsted Gundersen, Rafal Wisniewski

TL;DR
This paper evaluates alternative divergence measures like Wasserstein and Jensen-Shannon as substitutes for mutual information in continuous variables, analyzing their estimation methods and effectiveness.
Contribution
It introduces and compares various divergence-based measures as alternatives to mutual information for continuous variables, with a focus on their estimation and performance.
Findings
Wasserstein distance provides a robust alternative to KL-divergence.
Jensen-Shannon divergence offers a bounded measure suitable for continuous data.
Different estimation methods vary in accuracy and computational efficiency.
Abstract
Mutual information is a useful definition in information theory to estimate how much information the random variable holds about the random variable . One way to define the mutual information is by comparing the joint distribution of and with the product of the marginals through the KL-divergence. If the two distributions are close to each other there will be almost no leakage of from since the two variables are close to being independent. In the discrete setting the mutual information has the nice interpretation of how many bits reveals about and if (the Shannon entropy of ) then is completely revealed. However, in the continuous case we do not have the same reasoning. For instance the mutual information can be infinite in the continuous case. This fact enables us to try different metrics or divergences to define the mutual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Cognitive Science and Education Research · Wireless Communication Security Techniques
