Deep networks learn to parse uniform-depth context-free languages from local statistics

Jack T. Parley; Francesco Cagnetta; Matthieu Wyart

arXiv:2602.06065·stat.ML·February 10, 2026

Deep networks learn to parse uniform-depth context-free languages from local statistics

Jack T. Parley, Francesco Cagnetta, Matthieu Wyart

PDF

Open Access

TL;DR

This paper investigates how deep neural networks, including convolutional and transformer models, learn to parse uniform-depth context-free languages from local statistical cues, revealing the role of data correlations in hierarchical structure learning.

Contribution

The paper introduces a tunable class of probabilistic context-free grammars and an inference algorithm linking learnability to language statistics, validated across multiple neural architectures.

Findings

01

Correlations at different scales facilitate hierarchical representation learning.

02

Deep networks can parse context-free languages using local statistics.

03

Sample complexity depends on language ambiguity and correlation structure.

Abstract

Understanding how the structure of language can be learned from sentences alone is a central question in both cognitive science and machine learning. Studies of the internal representations of Large Language Models (LLMs) support their ability to parse text when predicting the next word, while representing semantic notions independently of surface form. Yet, which data statistics make these feats possible, and how much data is required, remain largely unknown. Probabilistic context-free grammars (PCFGs) provide a tractable testbed for studying these questions. However, prior work has focused either on the post-hoc characterization of the parsing-like algorithms used by trained networks; or on the learnability of PCFGs with fixed syntax, where parsing is unnecessary. Here, we (i) introduce a tunable class of PCFGs in which both the degree of ambiguity and the correlation structure across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Language and cultural evolution · Neurobiology of Language and Bilingualism