Towards Quantifying Intrinsic Generalization of Deep ReLU Networks

Shaeke Salman; Canlin Zhang; Xiuwen Liu; Washington Mio

arXiv:1910.08581·cs.LG·October 22, 2019

Towards Quantifying Intrinsic Generalization of Deep ReLU Networks

Shaeke Salman, Canlin Zhang, Xiuwen Liu, Washington Mio

PDF

Open Access

TL;DR

This paper investigates how deep ReLU networks generalize by piece-wise linear interpolation, revealing similar mechanisms in real and random label cases and providing insights into their generalization behavior.

Contribution

It offers a quantified analysis of deep ReLU networks' generalization via generalization intervals, comparing real and random label scenarios on standard datasets.

Findings

01

Deep ReLU networks generalize through piece-wise linear interpolation.

02

Generalization intervals behave similarly along pairwise directions in real and random cases.

03

Networks approximate the data manifold better with real labels, showing smaller changes along tangent directions.

Abstract

Understanding the underlying mechanisms that enable the empirical successes of deep neural networks is essential for further improving their performance and explaining such networks. Towards this goal, a specific question is how to explain the "surprising" behavior of the same over-parametrized deep neural networks that can generalize well on real datasets and at the same time "memorize" training samples when the labels are randomized. In this paper, we demonstrate that deep ReLU networks generalize from training samples to new points via piece-wise linear interpolation. We provide a quantified analysis on the generalization ability of a deep ReLU network: Given a fixed point $x$ and a fixed direction in the input space $S$ , there is always a segment such that any point on the segment will be classified the same as the fixed point $x$ . We call this segment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Generative Adversarial Networks and Image Synthesis

Methods*Communicated@Fast*How Do I Communicate to Expedia?