Zero Initialization of modified Gated Recurrent Encoder-Decoder Network for Short Term Load Forecasting
Vedanshu, M M Tripathi

TL;DR
This paper introduces Zero Initialization for single-layer neural networks, demonstrating it accelerates learning and improves accuracy in short-term load forecasting compared to traditional initialization methods.
Contribution
Proposes Zero Initialization for weights in single-layer networks and shows its effectiveness in speeding up training and enhancing accuracy in load forecasting tasks.
Findings
Zero Initialization reduces training epochs.
It improves forecast accuracy to 0.94% error.
Effective in seq2seq models for load prediction.
Abstract
Single layer Feedforward Neural Network(FNN) is used many a time as a last layer in models such as seq2seq or could be a simple RNN network. The importance of such layer is to transform the output to our required dimensions. When it comes to weights and biases initialization, there is no such specific technique that could speed up the learning process. We could depend on deep network initialization techniques such as Xavier or He initialization. But such initialization fails to show much improvement in learning speed or accuracy. In this paper we propose Zero Initialization (ZI) for weights of a single layer network. We first test this technique with on a simple RNN network and compare the results against Xavier, He and Identity initialization. As a final test we implement it on a seq2seq network. It was found that ZI considerably reduces the number of epochs used and improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Image and Signal Denoising Methods · Neural Networks and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
