Multi-pretrained Deep Neural Network

Zhen Hu; Zhuyin Xue; Tong Cui; Shiqiang Zong; Chenglong He

arXiv:1606.00540·cs.NE·June 3, 2016

Multi-pretrained Deep Neural Network

Zhen Hu, Zhuyin Xue, Tong Cui, Shiqiang Zong, Chenglong He

PDF

Open Access

TL;DR

This paper compares different pretraining models for deep neural networks, specifically DBN and SDA, analyzing their effects on initial performance and final convergence after fine-tuning.

Contribution

It investigates the differences between DBN and SDA as pretraining models and their impact on model convergence and performance.

Findings

01

DBN provides a better initial model.

02

Models pretrained with SDA converge to better final models after fine-tuning.

03

SDA can improve model convergence when used for second pretraining.

Abstract

Pretraining is widely used in deep neutral network and one of the most famous pretraining models is Deep Belief Network (DBN). The optimization formulas are different during the pretraining process for different pretraining models. In this paper, we pretrained deep neutral network by different pretraining models and hence investigated the difference between DBN and Stacked Denoising Autoencoder (SDA) when used as pretraining model. The experimental results show that DBN get a better initial model. However the model converges to a relatively worse model after the finetuning process. Yet after pretrained by SDA for the second time the model converges to a better model if finetuned.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Image Processing and 3D Reconstruction

MethodsDeep Belief Network