Word class representations spontaneously emerge in a deep neural network trained on next word prediction
Kishore Surendra, Achim Schilling, Paul Stoewer, Andreas Maier and, Patrick Krauss

TL;DR
This study shows that deep neural networks trained on next-word prediction spontaneously develop internal representations of word classes, supporting usage-based theories of language acquisition.
Contribution
It demonstrates that word class representations can emerge in neural networks without explicit instruction, providing insights into language learning mechanisms.
Findings
Neural network representations cluster by word class
Word classes emerge without explicit training
Supports usage-based language acquisition theories
Abstract
How do humans learn language, and can the first language be learned at all? These fundamental questions are still hotly debated. In contemporary linguistics, there are two major schools of thought that give completely opposite answers. According to Chomsky's theory of universal grammar, language cannot be learned because children are not exposed to sufficient data in their linguistic environment. In contrast, usage-based models of language assume a profound relationship between language structure and language use. In particular, contextual mental processing and mental representations are assumed to have the cognitive capacity to capture the complexity of actual language use at all levels. The prime example is syntax, i.e., the rules by which words are assembled into larger units such as sentences. Typically, syntactic rules are expressed as sequences of word classes. However, it remains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Language Development and Disorders · Language and cultural evolution
