Simple2Complex: Global Optimization by Gradient Descent

Ming Li

arXiv:1605.00404·cs.LG·May 3, 2016

Simple2Complex: Global Optimization by Gradient Descent

Ming Li

PDF

Open Access

TL;DR

The paper introduces Simple2Complex, a progressive training method for deep neural networks that incrementally adds layers to improve global optimization and reduce local minima trapping, demonstrated on CIFAR-10.

Contribution

It proposes a novel layer-wise growth training approach for deep networks, enhancing global optimization over traditional end-to-end methods.

Findings

01

Outperforms end-to-end training on CIFAR-10

02

Reduces likelihood of local minima trapping

03

Demonstrates effective layer-wise growth training

Abstract

A method named simple2complex for modeling and training deep neural networks is proposed. Simple2complex train deep neural networks by smoothly adding more and more layers to the shallow networks, as the learning procedure going on, the network is just like growing. Compared with learning by end2end, simple2complex is with less possibility trapping into local minimal, namely, owning ability for global optimization. Cifar10 is used for verifying the superiority of simple2complex.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Neural Networks and Applications