AdaCap: Adaptive Capacity control for Feed-Forward Neural Networks

Katia Meziani; Karim Lounici; Benjamin Riu

arXiv:2205.07860·cs.LG·May 18, 2022

AdaCap: Adaptive Capacity control for Feed-Forward Neural Networks

Katia Meziani, Karim Lounici, Benjamin Riu

PDF

Open Access

TL;DR

AdaCap introduces a novel training scheme for Feed-Forward Neural Networks that adaptively controls their capacity to learn high-level representations without overfitting, using innovative loss functions and training schemes.

Contribution

The paper proposes AdaCap, combining MLR loss and Tikhonov operator training to adaptively regulate neural network capacity, reducing memorization and improving generalization.

Findings

01

AdaCap effectively reduces memorization on small datasets.

02

It outperforms traditional methods in generalization performance.

03

The MLR loss accurately estimates out-of-sample performance.

Abstract

The capacity of a ML model refers to the range of functions this model can approximate. It impacts both the complexity of the patterns a model can learn but also memorization, the ability of a model to fit arbitrary labels. We propose Adaptive Capacity (AdaCap), a training scheme for Feed-Forward Neural Networks (FFNN). AdaCap optimizes the capacity of FFNN so it can capture the high-level abstract representations underlying the problem at hand without memorizing the training dataset. AdaCap is the combination of two novel ingredients, the Muddling labels for Regularization (MLR) loss and the Tikhonov operator training scheme. The MLR loss leverages randomly generated labels to quantify the propensity of a model to memorize. We prove that the MLR loss is an accurate in-sample estimator for out-of-sample generalization performance and that it can be used to perform Hyper-Parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM