Regularisation in neural networks: a survey and empirical analysis of approaches
Christiaan P. Opperman, Anna S. Bosman, and Katherine M. Malan

TL;DR
This paper reviews various regularisation techniques for neural networks, categorizes them, and empirically tests their effectiveness across multiple datasets and architectures, revealing dataset-dependent results.
Contribution
It provides a comprehensive taxonomy of regularisation methods and empirically evaluates their practical impact, highlighting when and where they improve neural network performance.
Findings
Regularisation effectiveness varies by dataset type.
Batch normalization benefits image datasets.
Regularisation improves numeric datasets performance.
Abstract
Despite huge successes on a wide range of tasks, neural networks are known to sometimes struggle to generalise to unseen data. Many approaches have been proposed over the years to promote the generalisation ability of neural networks, collectively known as regularisation techniques. These are used as common practice under the assumption that any regularisation added to the pipeline would result in a performance improvement. In this study, we investigate whether this assumption holds in practice. First, we provide a broad review of regularisation techniques, including modern theories such as double descent. We propose a taxonomy of methods under four broad categories, namely: (1) data-based strategies, (2) architecture strategies, (3) training strategies, and (4) loss function strategies. Notably, we highlight the contradictions and correspondences between the approaches in these broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
