Stability properties of Minimal Gated Unit neural networks

Stefano De Carli; Davide Previtali; Mirko Mazzoleni; Fabio Previdi

arXiv:2603.03017·math.OC·March 4, 2026

Stability properties of Minimal Gated Unit neural networks

Stefano De Carli, Davide Previtali, Mirko Mazzoleni, Fabio Previdi

PDF

Open Access

TL;DR

This paper analyzes the stability of the Minimal Gated Unit (MGU) neural network, deriving conditions for stability, proposing training methods, and demonstrating its effectiveness and efficiency on synthetic and real-world data, including the Silverbox benchmark.

Contribution

It introduces parametric stability conditions for MGU networks and develops stability-promoting training methods, enhancing efficiency and reliability in system identification tasks.

Findings

01

MGU networks satisfy input-to-state stability conditions.

02

Proposed training schemes improve stability and convergence.

03

MGU outperforms other RNNs on the Silverbox benchmark.

Abstract

In this work, we address the need for efficient and formally stable Recurrent Neural Networks (RNNs) in environments with limited computational resources by analyzing the stability of the Minimal Gated Unit (MGU) network, a lightweight alternative to common gated RNNs used in system identification. We derive sufficient parametric conditions for the MGU network's input-to-state stability and incremental input-to-state stability properties. These conditions enable a-posteriori validation of model stability and form the basis for novel stability-promoting training methodologies, including a warm-start of the network's parameters and a projected gradient-based optimization scheme, both of which are presented in this work. Comparative evaluation, including robustness analysis and validation on synthetic and real-world data (i.e., the Silverbox benchmark), demonstrates that the minimal gated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Machine Learning and ELM