Using UNet and PSPNet to explore the reusability principle of CNN parameters
Wei Wang

TL;DR
This paper investigates the reusability of CNN parameters in deep networks like UNet and PSPNet for segmentation and auto-encoder tasks, revealing insights into which parameters can be effectively reused to reduce training data needs.
Contribution
It experimentally quantifies parameter reusability in CNN layers, highlighting differences between Batch Normalization and Convolution layers, and clarifies reasons behind successful transfer learning.
Findings
Batch normalization running mean and variance are crucial for reusability.
Convolution layer weights are highly sensitive and less reusable.
Biases in convolution layers are easily reusable.
Abstract
How to reduce the requirement on training dataset size is a hot topic in deep learning community. One straightforward way is to reuse some pre-trained parameters. Some previous work like Deep transfer learning reuse the model parameters trained for the first task as the starting point for the second task, and semi-supervised learning is trained upon a combination of labeled and unlabeled data. However, the fundamental reason of the success of these methods is unclear. In this paper, the reusability of parameters in each layer of a deep convolutional neural network is experimentally quantified by using a network to do segmentation and auto-encoder task. This paper proves that network parameters can be reused for two reasons: first, the network features are general; Second, there is little difference between the pre-trained parameters and the ideal network parameters. Through the use of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsConvolution
