On the Generalization Error Bounds of Neural Networks under Diversity-Inducing Mutual Angular Regularization
Pengtao Xie, Yuntian Deng, Eric Xing

TL;DR
This paper analyzes how mutual angular regularization influences the generalization error of neural networks, showing that increased diversity among hidden units can reduce estimation error but may raise approximation error, supported by both theory and experiments.
Contribution
It provides the first rigorous theoretical analysis of mutual angular regularization's impact on neural network generalization, combining theory with empirical validation.
Findings
Increasing diversity reduces estimation error.
Increasing diversity raises approximation error.
Empirical results confirm theoretical predictions.
Abstract
Recently diversity-inducing regularization methods for latent variable models (LVMs), which encourage the components in LVMs to be diverse, have been studied to address several issues involved in latent variable modeling: (1) how to capture long-tail patterns underlying data; (2) how to reduce model complexity without sacrificing expressivity; (3) how to improve the interpretability of learned patterns. While the effectiveness of diversity-inducing regularizers such as the mutual angular regularizer has been demonstrated empirically, a rigorous theoretical analysis of them is still missing. In this paper, we aim to bridge this gap and analyze how the mutual angular regularizer (MAR) affects the generalization performance of supervised LVMs. We use neural network (NN) as a model instance to carry out the study and the analysis shows that increasing the diversity of hidden units in NN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Morphological variations and asymmetry
MethodsInterpretability
