Stability properties of Minimal Gated Unit neural networks
Stefano De Carli, Davide Previtali, Mirko Mazzoleni, Fabio Previdi

TL;DR
This paper analyzes the stability of the Minimal Gated Unit (MGU) neural network, deriving conditions for stability, proposing training methods, and demonstrating its effectiveness and efficiency on synthetic and real-world data, including the Silverbox benchmark.
Contribution
It introduces parametric stability conditions for MGU networks and develops stability-promoting training methods, enhancing efficiency and reliability in system identification tasks.
Findings
MGU networks satisfy input-to-state stability conditions.
Proposed training schemes improve stability and convergence.
MGU outperforms other RNNs on the Silverbox benchmark.
Abstract
In this work, we address the need for efficient and formally stable Recurrent Neural Networks (RNNs) in environments with limited computational resources by analyzing the stability of the Minimal Gated Unit (MGU) network, a lightweight alternative to common gated RNNs used in system identification. We derive sufficient parametric conditions for the MGU network's input-to-state stability and incremental input-to-state stability properties. These conditions enable a-posteriori validation of model stability and form the basis for novel stability-promoting training methodologies, including a warm-start of the network's parameters and a projected gradient-based optimization scheme, both of which are presented in this work. Comparative evaluation, including robustness analysis and validation on synthetic and real-world data (i.e., the Silverbox benchmark), demonstrates that the minimal gated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Machine Learning and ELM
