Precision, Stability, and Generalization: A Comprehensive Assessment of RNNs learnability capability for Classifying Counter and Dyck Languages
Neisarg Dave, Daniel Kifer, Lee Giles, Ankur Mali

TL;DR
This paper critically assesses RNNs' ability to learn structured formal languages, revealing that their performance heavily depends on data sampling, embedding precision, and initialization, challenging traditional beliefs about their expressiveness.
Contribution
The study demonstrates that RNNs' learnability is limited by data structure and sampling strategies, emphasizing the importance of constraints over mere expressivity for language classification.
Findings
RNN performance declines with increased similarity between positive and negative examples.
O2RNNs generally exhibit greater stability than LSTM models.
Basic RNN embeddings outperform chance in classifying Dyck and counter languages.
Abstract
This study investigates the learnability of Recurrent Neural Networks (RNNs) in classifying structured formal languages, focusing on counter and Dyck languages. Traditionally, both first-order (LSTM) and second-order (O2RNN) RNNs have been considered effective for such tasks, primarily based on their theoretical expressiveness within the Chomsky hierarchy. However, our research challenges this notion by demonstrating that RNNs primarily operate as state machines, where their linguistic capabilities are heavily influenced by the precision of their embeddings and the strategies used for sampling negative examples. Our experiments revealed that performance declines significantly as the structural similarity between positive and negative examples increases. Remarkably, even a basic single-layer classifier using RNN embeddings performed better than chance. To evaluate generalization, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Ferroelectric and Negative Capacitance Devices · Advanced Malware Detection Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
