Noisy Recurrent Neural Networks

Soon Hoe Lim; N. Benjamin Erichson; Liam Hodgkinson; Michael W.; Mahoney

arXiv:2102.04877·stat.ML·December 2, 2021·6 cites

Noisy Recurrent Neural Networks

Soon Hoe Lim, N. Benjamin Erichson, Liam Hodgkinson, Michael W., Mahoney

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a framework for analyzing noisy RNNs as discretizations of stochastic differential equations, revealing how noise injection acts as an implicit regularizer that enhances stability and robustness.

Contribution

It provides a theoretical analysis of noise injection in RNNs, showing its regularization effects and stability benefits, supported by empirical validation.

Findings

01

Noise promotes flatter minima and larger classification margins.

02

Noise injection improves model stability during training.

03

Empirical results show enhanced robustness to input perturbations.

Abstract

We provide a general framework for studying recurrent neural networks (RNNs) trained by injecting noise into hidden states. Specifically, we consider RNNs that can be viewed as discretizations of stochastic differential equations driven by input data. This framework allows us to study the implicit regularization effect of general noise injection schemes by deriving an approximate explicit regularizer in the small noise regime. We find that, under reasonable assumptions, this implicit regularization promotes flatter minima; it biases towards models with more stable dynamics; and, in classification tasks, it favors models with larger classification margin. Sufficient conditions for global stability are obtained, highlighting the phenomenon of stochastic stabilization, where noise injection can improve stability during training. Our theory is supported by empirical results which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erichson/NoisyRNN
pytorchOfficial

Videos

Noisy Recurrent Neural Networks· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning · Neural Networks and Applications