The Limitations of Limited Context for Constituency Parsing
Yuchen Li, Andrej Risteski

TL;DR
This paper investigates the representational limitations of neural constituency parsers with limited context, showing they cannot represent certain PCFGs' max-likelihood parses, unlike unlimited context models.
Contribution
It provides a formal analysis of how context limitations affect the ability of neural models to represent PCFGs' parses, highlighting fundamental constraints.
Findings
Limited context models cannot represent max-likelihood parses for some PCFGs.
Unlimited context models can represent the max-likelihood parse of any PCFG.
Context size and directionality critically impact the representational power of neural parsers.
Abstract
Incorporating syntax into neural approaches in NLP has a multitude of practical and scientific benefits. For instance, a language model that is syntax-aware is likely to be able to produce better samples; even a discriminative model like BERT with a syntax module could be used for core NLP tasks like unsupervised syntactic parsing. Rapid progress in recent years was arguably spurred on by the empirical success of the Parsing-Reading-Predict architecture of (Shen et al., 2018a), later simplified by the Order Neuron LSTM of (Shen et al., 2019). Most notably, this is the first time neural approaches were able to successfully perform unsupervised syntactic parsing (evaluated by various metrics like F-1 score). However, even heuristic (much less fully mathematical) understanding of why and when these architectures work is lagging severely behind. In this work, we answer representational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Tanh Activation · Linear Warmup With Linear Decay · Layer Normalization · Residual Connection · WordPiece
