Path classification by stochastic linear recurrent neural networks

Wiebke Bartolomaeus; Youness Boutaib; Sandra Nestler; Holger Rauhut

arXiv:2108.03090·stat.ML·September 13, 2023

Path classification by stochastic linear recurrent neural networks

Wiebke Bartolomaeus, Youness Boutaib, Sandra Nestler, Holger Rauhut

PDF

Open Access

TL;DR

This paper analyzes stochastic linear recurrent neural networks for path classification, providing theoretical generalization bounds, demonstrating their robustness and trainability, and exploring the trade-off between accuracy and robustness.

Contribution

It introduces a theoretical framework for understanding stochastic RNNs in classification, including generalization bounds and robustness properties, supported by numerical experiments.

Findings

01

RNNs retain path signatures as the key information for classification

02

Theoretical generalization error bounds are established for stochastic RNNs

03

Numerical experiments confirm robustness and reveal a trade-off between accuracy and robustness

Abstract

We investigate the functioning of a classifying biological neural network from the perspective of statistical learning theory, modelled, in a simplified setting, as a continuous-time stochastic recurrent neural network (RNN) with identity activation function. In the purely stochastic (robust) regime, we give a generalisation error bound that holds with high probability, thus showing that the empirical risk minimiser is the best-in-class hypothesis. We show that RNNs retain a partial signature of the paths they are fed as the unique information exploited for training and classification tasks. We argue that these RNNs are easy to train and robust and back these observations with numerical experiments on both synthetic and real data. We also exhibit a trade-off phenomenon between accuracy and robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Advanced Memory and Neural Computing