Over-parameterization: A Necessary Condition for Models that Extrapolate
Roozbeh Yousefzadeh

TL;DR
This paper investigates the necessity of over-parameterization for models to extrapolate beyond their training data's convex hull, providing theoretical proofs, empirical analysis, and a formalized definition of extrapolation in deep learning.
Contribution
It establishes that over-parameterization is necessary for extrapolation, offers a clearer definition of over-parameterization, and connects deep learning extrapolation to classical mathematical theories.
Findings
Over-parameterization is necessary for controlling domain partitioning outside the convex hull.
Empirical analysis of dataset geometry reveals insights into extrapolation capabilities.
Formalization of extrapolation relates deep learning models to classical mathematical problems.
Abstract
In this work, we study over-parameterization as a necessary condition for having the ability for the models to extrapolate outside the convex hull of training set. We specifically, consider classification models, e.g., image classification and other applications of deep learning. Such models are classification functions that partition their domain and assign a class to each partition \cite{strang2019linear}. Partitions are defined by decision boundaries and so is the classification model/function. Convex hull of training set may occupy only a subset of the domain, but trained model may partition the entire domain and not just the convex hull of training set. This is important because many of the testing samples may be outside the convex hull of training set and the way in which a model partitions its domain outside the convex hull would be influential in its generalization. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
MethodsAverage Pooling · 1x1 Convolution · Bottleneck Residual Block · Kaiming Initialization · Global Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Residual Connection · Convolution · Batch Normalization
