Estimating Mutual Information for Discrete-Continuous Mixtures
Weihao Gao, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

TL;DR
This paper introduces a new estimator for mutual information in discrete-continuous mixture spaces, addressing limitations of existing methods and enabling more accurate analysis in complex real-world data scenarios.
Contribution
The paper proposes a novel, consistent mutual information estimator specifically designed for discrete-continuous mixtures, expanding applicability beyond traditional purely discrete or continuous estimators.
Findings
Estimator is consistent and theoretically sound.
Numerical experiments show superiority over existing heuristics.
Method broadens mutual information estimation to mixed data types.
Abstract
Estimating mutual information from observed samples is a basic primitive, useful in several machine learning tasks including correlation mining, information bottleneck clustering, learning a Chow-Liu tree, and conditional independence testing in (causal) graphical models. While mutual information is a well-defined quantity in general probability spaces, existing estimators can only handle two special cases of purely discrete or purely continuous pairs of random variables. The main challenge is that these methods first estimate the (differential) entropies of X, Y and the pair (X;Y) and add them up with appropriate signs to get an estimate of the mutual information. These 3H-estimators cannot be applied in general mixture spaces, where entropy is not well-defined. In this paper, we design a novel estimator for mutual information of discrete-continuous mixtures. We prove that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Neural Networks and Applications · Distributed Sensor Networks and Detection Algorithms
