Progressive Learning for Systematic Design of Large Neural Networks

Saikat Chatterjee; Alireza M. Javid; Mostafa Sadeghi; Partha P. Mitra,; Mikael Skoglund

arXiv:1710.08177·cs.NE·October 24, 2017·24 cites

Progressive Learning for Systematic Design of Large Neural Networks

Saikat Chatterjee, Alireza M. Javid, Mostafa Sadeghi, Partha P. Mitra,, Mikael Skoglund

PDF

Open Access 1 Repo

TL;DR

This paper introduces a progressive, systematic approach for designing large neural networks by incrementally increasing network size and optimizing each layer with convex methods, reducing manual tuning and improving generalization.

Contribution

It proposes a novel progressive design algorithm leveraging the property of certain nonlinear functions, simplifying network construction and parameter regularization.

Findings

01

Networks designed with the method show good generalization.

02

Regularization reduces manual tuning effort.

03

Random weights help decrease learnable parameters.

Abstract

We develop an algorithm for systematic design of a large artificial neural network using a progression property. We find that some non-linear functions, such as the rectifier linear unit and its derivatives, hold the property. The systematic design addresses the choice of network size and regularization of parameters. The number of nodes and layers in network increases in progression with the objective of consistently reducing an appropriate cost. Each layer is optimized at a time, where appropriate parameters are learned using convex optimization. Regularization parameters for convex optimization do not need a significant manual effort for tuning. We also use random instances for some weight matrices, and that helps to reduce the number of parameters we learn. The developed network is expected to show good generalization power due to appropriate regularization and use of random weights…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viebboy/HeMLGOP
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Stochastic Gradient Optimization Techniques