Large Neural Networks Learning from Scratch with Very Few Data and   without Explicit Regularization

Christoph Linse; Thomas Martinetz

arXiv:2205.08836·cs.CV·October 24, 2022·1 cites

Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization

Christoph Linse, Thomas Martinetz

PDF

Open Access

TL;DR

Highly over-parameterized neural networks can learn effectively from very few data points without explicit regularization or pretraining, challenging traditional machine learning assumptions.

Contribution

This study empirically demonstrates that large CNNs can learn from limited data without regularization across multiple fine-grained classification datasets.

Findings

01

Large CNNs achieve high accuracy with minimal data without explicit regularization.

02

Over-parameterized models memorize training data yet generalize well in practice.

03

Randomly initialized VGG19 can distinguish classes with high accuracy using few samples.

Abstract

Recent findings have shown that highly over-parameterized Neural Networks generalize without pretraining or explicit regularization. It is achieved with zero training error, i.e., complete over-fitting by memorizing the training data. This is surprising, since it is completely against traditional machine learning wisdom. In our empirical study we fortify these findings in the domain of fine-grained image classification. We show that very large Convolutional Neural Networks with millions of weights do learn with only a handful of training samples and without image augmentation, explicit regularization or pretraining. We train the architectures ResNet018, ResNet101 and VGG19 on subsets of the difficult benchmark datasets Caltech101, CUB_200_2011, FGVCAircraft, Flowers102 and StanfordCars with 100 classes and more, perform a comprehensive comparative study and draw implications for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning