Generalization Comparison of Deep Neural Networks via Output Sensitivity

Mahsa Forouzesh; Farnood Salehi; Patrick Thiran

arXiv:2007.15378·cs.LG·July 31, 2020

Generalization Comparison of Deep Neural Networks via Output Sensitivity

Mahsa Forouzesh, Farnood Salehi, Patrick Thiran

PDF

1 Repo

TL;DR

This paper investigates the relationship between output sensitivity and generalization in deep neural networks, proposing sensitivity as a label-free metric to compare models' generalization capabilities.

Contribution

It reveals a strong empirical link between output sensitivity and loss variance, and suggests using sensitivity as a new metric for assessing generalization performance.

Findings

01

Sensitivity decreases with techniques that improve generalization

02

Deeper networks and convolutional layers reduce sensitivity

03

Batch normalization, dropout, and initialization also lower sensitivity

Abstract

Although recent works have brought some insights into the performance improvement of techniques used in state-of-the-art deep-learning models, more work is needed to understand their generalization properties. We shed light on this matter by linking the loss function to the output's sensitivity to its input. We find a rather strong empirical relation between the output sensitivity and the variance in the bias-variance decomposition of the loss function, which hints on using sensitivity as a metric for comparing the generalization performance of networks, without requiring labeled data. We find that sensitivity is decreased by applying popular methods which improve the generalization performance of the model, such as (1) using a deep network rather than a wide one, (2) adding convolutional layers to baseline classifiers instead of adding fully-connected layers, (3) using batch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mahf93/sensitivity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout