Simple2Complex: Global Optimization by Gradient Descent
Ming Li

TL;DR
The paper introduces Simple2Complex, a progressive training method for deep neural networks that incrementally adds layers to improve global optimization and reduce local minima trapping, demonstrated on CIFAR-10.
Contribution
It proposes a novel layer-wise growth training approach for deep networks, enhancing global optimization over traditional end-to-end methods.
Findings
Outperforms end-to-end training on CIFAR-10
Reduces likelihood of local minima trapping
Demonstrates effective layer-wise growth training
Abstract
A method named simple2complex for modeling and training deep neural networks is proposed. Simple2complex train deep neural networks by smoothly adding more and more layers to the shallow networks, as the learning procedure going on, the network is just like growing. Compared with learning by end2end, simple2complex is with less possibility trapping into local minimal, namely, owning ability for global optimization. Cifar10 is used for verifying the superiority of simple2complex.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
