A Differential Entropy Estimator for Training Neural Networks
Georg Pichler, Pierre Colombo, Malik Boudiaf, G\"unther, Koliander, Pablo Piantanida

TL;DR
This paper introduces KNIFE, a novel differentiable kernel-based estimator for differential entropy, enabling its use as a regularizer in neural network training across various high-dimensional tasks.
Contribution
The paper presents KNIFE, a fully parameterized, differentiable estimator for differential entropy and mutual information, overcoming previous limitations and broadening their application in neural network training.
Findings
KNIFE effectively estimates differential entropy in high-dimensional data.
Using KNIFE improves neural network training in tasks like domain adaptation and fair classification.
The method is validated on synthetic and real-world datasets, demonstrating versatility and accuracy.
Abstract
Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when learn disentangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of applications than MI, off-the-shelf DE estimators are either non differentiable, computationally intractable or fail to adapt to changes in the underlying distribution. These drawbacks prevent them from being used as regularizers in neural networks training. To address shortcomings in previously proposed estimators for DE, here we introduce KNIFE, a fully parameterized, differentiable kernel-based estimator of DE. The flexibility of our approach also allows us to construct KNIFE-based estimators…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
MethodsNetwork On Network
