Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited
S. E. Marzen, J. P. Crutchfield

TL;DR
This paper evaluates the predictive capabilities of various neural models on probabilistic finite automata, revealing significant gaps in accuracy and emphasizing the importance of causal state inference methods.
Contribution
It systematically compares RNNs, RCs, and linear models on PDFAs, highlighting their limitations and the need for causal state inference in predictive modeling.
Findings
LSTMs outperform RCs and linear models
Models can be up to 50% less accurate than optimal predictors
Optimized models still fall short of maximal accuracy by ~5%
Abstract
Reservoir computers (RCs) and recurrent neural networks (RNNs) can mimic any finite-state automaton in theory, and some workers demonstrated that this can hold in practice. We test the capability of generalized linear models, RCs, and Long Short-Term Memory (LSTM) RNN architectures to predict the stochastic processes generated by a large suite of probabilistic deterministic finite-state automata (PDFA). PDFAs provide an excellent performance benchmark in that they can be systematically enumerated, the randomness and correlation structure of their generated processes are exactly known, and their optimal memory-limited predictors are easily computed. Unsurprisingly, LSTMs outperform RCs, which outperform generalized linear models. Surprisingly, each of these methods can fall short of the maximal predictive accuracy by as much as 50% after training and, when optimized, tend to fall short…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Machine Learning and ELM · Neural Networks and Reservoir Computing
MethodsTest
