The Limitations of Limited Context for Constituency Parsing

Yuchen Li; Andrej Risteski

arXiv:2106.01580·cs.CL·June 4, 2021

The Limitations of Limited Context for Constituency Parsing

Yuchen Li, Andrej Risteski

PDF

Open Access

TL;DR

This paper investigates the representational limitations of neural constituency parsers with limited context, showing they cannot represent certain PCFGs' max-likelihood parses, unlike unlimited context models.

Contribution

It provides a formal analysis of how context limitations affect the ability of neural models to represent PCFGs' parses, highlighting fundamental constraints.

Findings

01

Limited context models cannot represent max-likelihood parses for some PCFGs.

02

Unlimited context models can represent the max-likelihood parse of any PCFG.

03

Context size and directionality critically impact the representational power of neural parsers.

Abstract

Incorporating syntax into neural approaches in NLP has a multitude of practical and scientific benefits. For instance, a language model that is syntax-aware is likely to be able to produce better samples; even a discriminative model like BERT with a syntax module could be used for core NLP tasks like unsupervised syntactic parsing. Rapid progress in recent years was arguably spurred on by the empirical success of the Parsing-Reading-Predict architecture of (Shen et al., 2018a), later simplified by the Order Neuron LSTM of (Shen et al., 2019). Most notably, this is the first time neural approaches were able to successfully perform unsupervised syntactic parsing (evaluated by various metrics like F-1 score). However, even heuristic (much less fully mathematical) understanding of why and when these architectures work is lagging severely behind. In this work, we answer representational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Tanh Activation · Linear Warmup With Linear Decay · Layer Normalization · Residual Connection · WordPiece