Zero Initialization of modified Gated Recurrent Encoder-Decoder Network   for Short Term Load Forecasting

Vedanshu; M M Tripathi

arXiv:1812.03425·cs.LG·December 11, 2018·1 cites

Zero Initialization of modified Gated Recurrent Encoder-Decoder Network for Short Term Load Forecasting

Vedanshu, M M Tripathi

PDF

Open Access

TL;DR

This paper introduces Zero Initialization for single-layer neural networks, demonstrating it accelerates learning and improves accuracy in short-term load forecasting compared to traditional initialization methods.

Contribution

Proposes Zero Initialization for weights in single-layer networks and shows its effectiveness in speeding up training and enhancing accuracy in load forecasting tasks.

Findings

01

Zero Initialization reduces training epochs.

02

It improves forecast accuracy to 0.94% error.

03

Effective in seq2seq models for load prediction.

Abstract

Single layer Feedforward Neural Network(FNN) is used many a time as a last layer in models such as seq2seq or could be a simple RNN network. The importance of such layer is to transform the output to our required dimensions. When it comes to weights and biases initialization, there is no such specific technique that could speed up the learning process. We could depend on deep network initialization techniques such as Xavier or He initialization. But such initialization fails to show much improvement in learning speed or accuracy. In this paper we propose Zero Initialization (ZI) for weights of a single layer network. We first test this technique with on a simple RNN network and compare the results against Xavier, He and Identity initialization. As a final test we implement it on a seq2seq network. It was found that ZI considerably reduces the number of epochs used and improve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEnergy Load and Power Forecasting · Image and Signal Denoising Methods · Neural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence