Multi-pretrained Deep Neural Network
Zhen Hu, Zhuyin Xue, Tong Cui, Shiqiang Zong, Chenglong He

TL;DR
This paper compares different pretraining models for deep neural networks, specifically DBN and SDA, analyzing their effects on initial performance and final convergence after fine-tuning.
Contribution
It investigates the differences between DBN and SDA as pretraining models and their impact on model convergence and performance.
Findings
DBN provides a better initial model.
Models pretrained with SDA converge to better final models after fine-tuning.
SDA can improve model convergence when used for second pretraining.
Abstract
Pretraining is widely used in deep neutral network and one of the most famous pretraining models is Deep Belief Network (DBN). The optimization formulas are different during the pretraining process for different pretraining models. In this paper, we pretrained deep neutral network by different pretraining models and hence investigated the difference between DBN and Stacked Denoising Autoencoder (SDA) when used as pretraining model. The experimental results show that DBN get a better initial model. However the model converges to a relatively worse model after the finetuning process. Yet after pretrained by SDA for the second time the model converges to a better model if finetuned.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Image Processing and 3D Reconstruction
MethodsDeep Belief Network
