REMEDI: Corrective Transformations for Improved Neural Entropy Estimation
Viktor Nilsson, Anirban Samaddar, Sandeep Madireddy, Pierre Nyquist

TL;DR
REMEDI is a novel method for more accurate and efficient differential entropy estimation that leverages adaptive base models and theoretical insights, improving performance in high-dimensional data scenarios and supervised learning.
Contribution
The paper introduces REMEDI, a new approach combining cross-entropy minimization and deviation estimation, extending theoretical guarantees and applying to supervised learning and generative modeling.
Findings
Improves entropy estimation accuracy on synthetic and natural data.
Enhances the Information Bottleneck method with better entropy estimates.
Establishes theoretical consistency for the proposed estimation framework.
Abstract
Information theoretic quantities play a central role in machine learning. The recent surge in the complexity of data and models has increased the demand for accurate estimation of these quantities. However, as the dimension grows the estimation presents significant challenges, with existing methods struggling already in relatively low dimensions. To address this issue, in this work, we introduce for efficient and accurate estimation of differential entropy, a fundamental information theoretic quantity. The approach combines the minimization of the cross-entropy for simple, adaptive base models and the estimation of their deviation, in terms of the relative entropy, from the data density. Our approach demonstrates improvement across a broad spectrum of estimation tasks, encompassing entropy estimation on both synthetic and natural data. Further, we extend important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks
MethodsFocus · Balanced Selection
