Large Language Models as Proxies for Theories of Human Linguistic Cognition
Imry Ziv, Nur Lan, Emmanuel Chemla, Roni Katzir

TL;DR
This paper explores the potential of large language models to serve as proxies for theories of human linguistic cognition, highlighting their current limitations and possible future applications in understanding language acquisition.
Contribution
It introduces the idea of using LLMs as proxies for cognitive theories and discusses their potential and current limitations in modeling language acquisition processes.
Findings
LLMs can help test linguistic theories against corpus data
Current LLMs have limited capability to fully serve as cognitive proxies
Potential for LLMs to compare ease of acquiring different linguistic patterns
Abstract
We consider the possible role of current large language models (LLMs) in the study of human linguistic cognition. We focus on the use of such models as proxies for theories of cognition that are relatively linguistically-neutral in their representations and learning but differ from current LLMs in key ways. We illustrate this potential use of LLMs as proxies for theories of cognition in the context of two kinds of questions: (a) whether the target theory accounts for the acquisition of a given pattern from a given corpus; and (b) whether the target theory makes a given typologically-attested pattern easier to acquire than another, typologically-unattested pattern. For each of the two questions we show, building on recent literature, how current LLMs can potentially be of help, but we note that at present this help is quite limited.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsFocus
