Convolutional Neural Networks with Dynamic Regularization

Yi Wang; Zhen-Peng Bian; Junhui Hou; Lap-Pui Chau

arXiv:1909.11862·cs.CV·January 1, 2021·5 cites

Convolutional Neural Networks with Dynamic Regularization

Yi Wang, Zhen-Peng Bian, Junhui Hou, Lap-Pui Chau

PDF

Open Access

TL;DR

This paper introduces a dynamic regularization technique for CNNs that adjusts regularization strength based on training loss, improving generalization without manual tuning.

Contribution

It presents a novel adaptive regularization method that automatically balances overfitting and underfitting during CNN training.

Findings

01

Outperforms existing regularization methods on standard architectures

02

Automatically adjusts regularization strength based on training loss

03

Enhances model generalization capabilities

Abstract

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization performance. However, these methods lack a self-adaptive ability throughout training. That is, the regularization strength is fixed to a predefined schedule, and manual adjustments are required to adapt to various network architectures. In this paper, we propose a dynamic regularization method for CNNs. Specifically, we model the regularization strength as a function of the training loss. According to the change of the training loss, our method can dynamically adjust the regularization strength in the training procedure, thereby balancing the underfitting and overfitting of CNNs. With dynamic regularization, a large-scale model is automatically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsDropBlock