A Matrix Ensemble Kalman Filter-based Multi-arm Neural Network to Adequately Approximate Deep Neural Networks
Ved Piyush, Yuchen Yan, Yuzhen Zhou, Yanbin Yin, Souparno Ghosh

TL;DR
This paper introduces MEnKF-ANN, a matrix ensemble Kalman filter-based multi-arm neural network that effectively approximates deep neural networks, including LSTMs, especially with small sample sizes, and provides uncertainty estimates.
Contribution
It presents a novel multi-arm Kalman filter approach for neural network approximation that handles unequal feature set sizes and offers uncertainty quantification.
Findings
Successfully approximates LSTM networks for microbiome classification
Performs well with small training samples
Provides reliable uncertainty estimates in predictions
Abstract
Deep Learners (DLs) are the state-of-art predictive mechanism with applications in many fields requiring complex high dimensional data processing. Although conventional DLs get trained via gradient descent with back-propagation, Kalman Filter (KF)-based techniques that do not need gradient computation have been developed to approximate DLs. We propose a multi-arm extension of a KF-based DL approximator that can mimic DL when the sample size is too small to train a multi-arm DL. The proposed Matrix Ensemble Kalman Filter-based multi-arm ANN (MEnKF-ANN) also performs explicit model stacking that becomes relevant when the training sample has an unequal-size feature set. Our proposed technique can approximate Long Short-term Memory (LSTM) Networks and attach uncertainty to the predictions obtained from these LSTMs with desirable coverage. We demonstrate how MEnKF-ANN can "adequately"…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gene expression and cancer classification · Image Retrieval and Classification Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
