Grammar as a Foreign Language

Oriol Vinyals; Lukasz Kaiser; Terry Koo; Slav Petrov; Ilya Sutskever,; Geoffrey Hinton

arXiv:1412.7449·cs.CL·June 11, 2015·402 cites

Grammar as a Foreign Language

Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever,, Geoffrey Hinton

PDF

Open Access 5 Repos

TL;DR

This paper introduces a domain-agnostic, attention-enhanced sequence-to-sequence model for syntactic constituency parsing that achieves state-of-the-art accuracy, high data efficiency, and fast processing speeds.

Contribution

It demonstrates that a simple, attention-based seq2seq model can outperform complex parsers and be highly data-efficient for syntactic parsing tasks.

Findings

01

Achieves state-of-the-art results on standard datasets.

02

Matches performance of traditional parsers with limited human annotations.

03

Processes over a hundred sentences per second on CPU.

Abstract

Syntactic constituency parsing is a fundamental problem in natural language processing and has been the subject of intensive research and engineering for decades. As a result, the most accurate parsers are domain specific, complex, and inefficient. In this paper we show that the domain agnostic attention-enhanced sequence-to-sequence model achieves state-of-the-art results on the most widely used syntactic constituency parsing dataset, when trained on a large synthetic corpus that was annotated using existing parsers. It also matches the performance of standard parsers when trained only on a small human-annotated dataset, which shows that this model is highly data-efficient, in contrast to sequence-to-sequence models without the attention mechanism. Our parser is also fast, processing over a hundred sentences per second with an unoptimized CPU implementation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications