A Novel Neural Network Training Framework with Data Assimilation
Chong Chen, Qinghui Xing, Xin Ding, Yaru Xue, Tianfu Zhong

TL;DR
This paper introduces a gradient-free neural network training framework using data assimilation techniques, enabling online and offline learning without relying on gradient calculations, and demonstrates its effectiveness through synthetic function regression tasks.
Contribution
The study proposes a novel gradient-free training framework for neural networks based on data assimilation algorithms like EnKF and ESMDA, offering an alternative to traditional gradient descent methods.
Findings
The proposed framework outperforms gradient descent in synthetic regression tasks.
It enables online and offline training without gradient dependence.
The method shows promise for training various neural network architectures.
Abstract
In recent years, the prosperity of deep learning has revolutionized the Artificial Neural Networks. However, the dependence of gradients and the offline training mechanism in the learning algorithms prevents the ANN for further improvement. In this study, a gradient-free training framework based on data assimilation is proposed to avoid the calculation of gradients. In data assimilation algorithms, the error covariance between the forecasts and observations is used to optimize the parameters. Feedforward Neural Networks (FNNs) are trained by gradient decent, data assimilation algorithms (Ensemble Kalman Filter (EnKF) and Ensemble Smoother with Multiple Data Assimilation (ESMDA)), respectively. ESMDA trains FNN with pre-defined iterations by updating the parameters using all the available observations which can be regard as offline learning. EnKF optimize FNN when new observation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeteorological Phenomena and Simulations · Hydrological Forecasting Using AI · Energy Load and Power Forecasting
