On the Generalization Error Bounds of Neural Networks under   Diversity-Inducing Mutual Angular Regularization

Pengtao Xie; Yuntian Deng; Eric Xing

arXiv:1511.07110·cs.LG·November 24, 2015·23 cites

On the Generalization Error Bounds of Neural Networks under Diversity-Inducing Mutual Angular Regularization

Pengtao Xie, Yuntian Deng, Eric Xing

PDF

Open Access

TL;DR

This paper analyzes how mutual angular regularization influences the generalization error of neural networks, showing that increased diversity among hidden units can reduce estimation error but may raise approximation error, supported by both theory and experiments.

Contribution

It provides the first rigorous theoretical analysis of mutual angular regularization's impact on neural network generalization, combining theory with empirical validation.

Findings

01

Increasing diversity reduces estimation error.

02

Increasing diversity raises approximation error.

03

Empirical results confirm theoretical predictions.

Abstract

Recently diversity-inducing regularization methods for latent variable models (LVMs), which encourage the components in LVMs to be diverse, have been studied to address several issues involved in latent variable modeling: (1) how to capture long-tail patterns underlying data; (2) how to reduce model complexity without sacrificing expressivity; (3) how to improve the interpretability of learned patterns. While the effectiveness of diversity-inducing regularizers such as the mutual angular regularizer has been demonstrated empirically, a rigorous theoretical analysis of them is still missing. In this paper, we aim to bridge this gap and analyze how the mutual angular regularizer (MAR) affects the generalization performance of supervised LVMs. We use neural network (NN) as a model instance to carry out the study and the analysis shows that increasing the diversity of hidden units in NN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Morphological variations and asymmetry

MethodsInterpretability