Mutual Information Multinomial Estimation
Yanzhi Chen, Zijing Ou, Adrian Weller, Yingzhen Li

TL;DR
This paper introduces a novel mutual information estimator that leverages a preliminary data distribution estimate to improve accuracy, demonstrated through synthetic and real-world experiments.
Contribution
It presents a new mutual information estimator that uses a preliminary distribution estimate to effectively bridge joint and marginal distributions, enhancing estimation accuracy.
Findings
Effective in non-Gaussian synthetic problems with known ground-truth
Demonstrates advantages in real-world applications
Outperforms existing MI estimation methods
Abstract
Estimating mutual information (MI) is a fundamental yet challenging task in data science and machine learning. This work proposes a new estimator for mutual information. Our main discovery is that a preliminary estimate of the data distribution can dramatically help estimate. This preliminary estimate serves as a bridge between the joint and the marginal distribution, and by comparing with this bridge distribution we can easily obtain the true difference between the joint distributions and the marginal distributions. Experiments on diverse tasks including non-Gaussian synthetic problems with known ground-truth and real-world applications demonstrate the advantages of our method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Statistical Methods and Inference
