Regularisation of Neural Networks by Enforcing Lipschitz Continuity

Henry Gouk; Eibe Frank; Bernhard Pfahringer; Michael J. Cree

arXiv:1804.04368·stat.ML·August 11, 2020

Regularisation of Neural Networks by Enforcing Lipschitz Continuity

Henry Gouk, Eibe Frank, Bernhard Pfahringer, Michael J. Cree

PDF

1 Repo 1 Video

TL;DR

This paper introduces a simple method to enforce Lipschitz continuity in neural networks, improving model performance especially with limited data by formulating it as a constrained optimization problem.

Contribution

It provides a straightforward technique to compute Lipschitz bounds for neural networks and integrates this into training as a constrained optimization, outperforming common regularizers.

Findings

01

Models with enforced Lipschitz continuity outperform those with standard regularizers.

02

The method is effective with small training datasets.

03

Hyperparameters are intuitive to tune for the proposed method.

Abstract

We investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant---for multiple $p$ -norms---of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounded Lipschitz constant as a constrained optimisation problem that can be solved using projected stochastic gradient methods. Our evaluation study shows that the performance of the resulting models exceeds that of models trained with other common regularisers. We also provide evidence that the hyperparameters are intuitive to tune, demonstrate how the choice of norm for computing the Lipschitz constant impacts the resulting model, and show that the performance gains provided by our method are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

henrygouk/keras-lipschitz-networks
noneOfficial

Videos

[Quiz] Regularization in Deep Learning, Lipschitz continuity, Gradient regularization· youtube

Taxonomy

MethodsLipschitz Constant Constraint