Unsupervised Recurrent Neural Network Grammars

Yoon Kim; Alexander M. Rush; Lei Yu; Adhiguna Kuncoro; Chris Dyer,; G\'abor Melis

arXiv:1904.03746·cs.CL·August 6, 2019·5 cites

Unsupervised Recurrent Neural Network Grammars

Yoon Kim, Alexander M. Rush, Lei Yu, Adhiguna Kuncoro, Chris Dyer,, G\'abor Melis

PDF

Open Access 1 Repo

TL;DR

This paper explores unsupervised learning of recurrent neural network grammars using variational inference, demonstrating competitive performance in language modeling and grammar induction without requiring annotated parse trees.

Contribution

It introduces an unsupervised training method for RNNGs using an inference network as a neural CRF parser, enabling grammar induction without labeled data.

Findings

01

Unsupervised RNNGs match supervised models in language modeling tasks.

02

They perform competitively in constituency grammar induction.

03

The approach works well for English and Chinese datasets.

Abstract

Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve strong language modeling and parsing performance, but require an annotated corpus of parse trees. In this work, we experiment with unsupervised learning of RNNGs. Since directly marginalizing over the space of latent trees is intractable, we instead apply amortized variational inference. To maximize the evidence lower bound, we develop an inference network parameterized as a neural CRF constituency parser. On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese. On constituency grammar induction, they are competitive with recent neural language models that induce tree structures from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

harvardnlp/urnng
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsConditional Random Field