DEMI: Discriminative Estimator of Mutual Information
Ruizhi Liao, Daniel Moyer, Polina Golland, William M. Wells

TL;DR
DEMI introduces a classifier-based mutual information estimator that overcomes the limitations of variational methods, providing accurate, low-variance estimates especially useful in high-dimensional data and representation learning.
Contribution
The paper presents a novel mutual information estimator based on classification probabilities, establishing a direct connection to mutual information and surpassing variational approaches in accuracy and stability.
Findings
High accuracy in mutual information estimation demonstrated.
Advantages shown in representation learning tasks.
Theoretical equivalence to variational methods at optimum.
Abstract
Estimating mutual information between continuous random variables is often intractable and extremely challenging for high-dimensional data. Recent progress has leveraged neural networks to optimize variational lower bounds on mutual information. Although showing promise for this difficult problem, the variational methods have been theoretically and empirically proven to have serious statistical limitations: 1) many methods struggle to produce accurate estimates when the underlying mutual information is either low or high; 2) the resulting estimators may suffer from high variance. Our approach is based on training a classifier that provides the probability that a data sample pair is drawn from the joint distribution rather than from the product of its marginal distributions. Moreover, we establish a direct connection between mutual information and the average log odds estimate produced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
