Label-Based Diversity Measure Among Hidden Units of Deep Neural Networks: A Regularization Method
Chenguang Zhang, Yuexian Hou, Dawei Song, Liangzhu Ge and, Yaoshuai Yao

TL;DR
This paper introduces an information-theoretic measure of diversity among hidden units in deep neural networks, linking it to generalization capacity and proposing a regularization method to reduce overfitting.
Contribution
It formalizes a new redundancy-based diversity measure rooted in mutual information and demonstrates its effectiveness as a regularizer for improving generalization.
Findings
Redundancy reduction improves generalization capacity.
Regularization with the redundancy measure reduces overfitting.
Theoretical proof links lower redundancy to better generalization.
Abstract
Although the deep structure guarantees the powerful expressivity of deep networks (DNNs), it also triggers serious overfitting problem. To improve the generalization capacity of DNNs, many strategies were developed to improve the diversity among hidden units. However, most of these strategies are empirical and heuristic in absence of either a theoretical derivation of the diversity measure or a clear connection from the diversity to the generalization capacity. In this paper, from an information theoretic perspective, we introduce a new definition of redundancy to describe the diversity of hidden units under supervised learning settings by formalizing the effect of hidden layers on the generalization capacity as the mutual information. We prove an opposite relationship existing between the defined redundancy and the generalization capacity, i.e., the decrease of redundancy generally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Face and Expression Recognition
