Towards Quantifying Intrinsic Generalization of Deep ReLU Networks
Shaeke Salman, Canlin Zhang, Xiuwen Liu, Washington Mio

TL;DR
This paper investigates how deep ReLU networks generalize by piece-wise linear interpolation, revealing similar mechanisms in real and random label cases and providing insights into their generalization behavior.
Contribution
It offers a quantified analysis of deep ReLU networks' generalization via generalization intervals, comparing real and random label scenarios on standard datasets.
Findings
Deep ReLU networks generalize through piece-wise linear interpolation.
Generalization intervals behave similarly along pairwise directions in real and random cases.
Networks approximate the data manifold better with real labels, showing smaller changes along tangent directions.
Abstract
Understanding the underlying mechanisms that enable the empirical successes of deep neural networks is essential for further improving their performance and explaining such networks. Towards this goal, a specific question is how to explain the "surprising" behavior of the same over-parametrized deep neural networks that can generalize well on real datasets and at the same time "memorize" training samples when the labels are randomized. In this paper, we demonstrate that deep ReLU networks generalize from training samples to new points via piece-wise linear interpolation. We provide a quantified analysis on the generalization ability of a deep ReLU network: Given a fixed point and a fixed direction in the input space , there is always a segment such that any point on the segment will be classified the same as the fixed point . We call this segment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Generative Adversarial Networks and Image Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia?
