On the Estimation of Mutual Information
Nicholas Carrara, Jesse Ernst

TL;DR
This paper examines the robustness of mutual information estimators from finite samples, emphasizing their invariance under transformations relevant to machine learning, and evaluates existing non-parametric estimators like Kraskov's for these criteria.
Contribution
It introduces a framework to assess the robustness of mutual information estimators under various transformations and analyzes the performance of existing estimators within this framework.
Findings
Kraskov's estimator shows high robustness under coordinate transformations.
Existing estimators vary significantly in their invariance properties.
Robust estimators can better inform transformation choices in machine learning.
Abstract
In this paper we focus on the estimation of mutual information from finite samples . The main concern with estimations of mutual information is their robustness under the class of transformations for which it remains invariant: i.e. type I (coordinate transformations), III (marginalizations) and special cases of type IV (embeddings, products). Estimators which fail to meet these standards are not \textit{robust} in their general applicability. Since most machine learning tasks employ transformations which belong to the classes referenced in part I, the mutual information can tell us which transformations are most optimal\cite{Carrara_Ernst}. There are several classes of estimation methods in the literature, such as non-parametric estimators like the one developed by Kraskov et. al\cite{KSG}, and its improved versions\cite{LNC}. These estimators are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Neural Networks and Applications · Fault Detection and Control Systems
