What's Going On in Neural Constituency Parsers? An Analysis
David Gaddy, Mitchell Stern, and Dan Klein

TL;DR
This paper investigates how modern neural constituency parsers implicitly learn information traditionally provided by explicit grammatical structures, showing neural models can encode similar linguistic knowledge.
Contribution
The study introduces a high-performance neural parser and analyzes its internal representations to demonstrate it captures information once supplied by explicit grammatical components.
Findings
Neural models encode information similar to grammars and lexicons.
Implicit learning reduces the need for explicit structural components.
The proposed model achieves 92.08 F1 on PTB.
Abstract
A number of differences have emerged between modern and classic approaches to constituency parsing in recent years, with structural components like grammars and feature-rich lexicons becoming less central while recurrent neural network representations rise in popularity. The goal of this work is to analyze the extent to which information provided directly by the model structure in classical systems is still being captured by neural methods. To this end, we propose a high-performance neural model (92.08 F1 on PTB) that is representative of recent work and perform a series of investigative experiments. We find that our model implicitly learns to encode much of the same information that was explicitly provided by grammars and lexicons in the past, indicating that this scaffolding can largely be subsumed by powerful general-purpose neural machinery.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
