Sequential training algorithm for neural networks

Jongrae Kim

arXiv:1905.07490·cs.LG·May 21, 2019·1 cites

Sequential training algorithm for neural networks

Jongrae Kim

PDF

Open Access

TL;DR

This paper introduces a sequential training algorithm for large-scale neural networks that trains each layer separately to reduce computational resources and improve convergence, offering a practical alternative to full network training.

Contribution

The paper proposes a novel sequential training method for neural networks that simplifies training and reduces resource requirements with minimal modifications to existing algorithms.

Findings

01

Reduces training time significantly

02

Achieves comparable performance with full training

03

Requires minimal changes to existing algorithms

Abstract

A sequential training method for large-scale feedforward neural networks is presented. Each layer of the neural network is decoupled and trained separately. After the training is completed for each layer, they are combined together. The performance of the network would be sub-optimal compared to the full network training if the optimal solution would be achieved. However, achieving the optimal solution for the full network would be infeasible or require long computing time. The proposed sequential approach reduces the required computer resources significantly and would have better convergences as a single layer is optimised for each optimisation step. The required modifications of existing algorithms to implement the sequential training are minimal. The performance is verified by a simple example.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Face and Expression Recognition