Mutual Information Estimation via $f$-Divergence and Data Derangements
Nunzio A. Letizia, Nicola Novello, Andrea M. Tonello

TL;DR
This paper introduces a new mutual information estimator based on $f$-divergence and derangements, achieving better bias-variance trade-off, higher accuracy, and lower complexity compared to existing neural methods.
Contribution
It proposes a novel discriminative mutual information estimator using $f$-divergence and derangements, improving accuracy and efficiency over prior neural estimators.
Findings
Higher accuracy than state-of-the-art methods
Lower complexity in estimation process
Better bias-variance trade-off
Abstract
Estimating mutual information accurately is pivotal across diverse applications, from machine learning to communications and biology, enabling us to gain insights into the inner mechanisms of complex systems. Yet, dealing with high-dimensional data presents a formidable challenge, due to its size and the presence of intricate relationships. Recently proposed neural methods employing variational lower bounds on the mutual information have gained prominence. However, these approaches suffer from either high bias or high variance, as the sample size and the structure of the loss function directly influence the training process. In this paper, we propose a novel class of discriminative mutual information estimators based on the variational representation of the -divergence. We investigate the impact of the permutation function used to obtain the marginal training samples and present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Anomaly Detection Techniques and Applications
