Grammar Induction with Neural Language Models: An Unusual Replication

Phu Mon Htut; Kyunghyun Cho; Samuel R. Bowman

arXiv:1808.10000·cs.CL·August 31, 2018

Grammar Induction with Neural Language Models: An Unusual Replication

Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman

PDF

Open Access 1 Repo

TL;DR

This paper critically examines and replicates recent neural latent tree learning models, confirming their robustness and suggesting neural language models are promising for grammar induction.

Contribution

It provides a fair replication of previous results, extends the analysis to new datasets, and establishes neural language models as a viable approach for grammar induction.

Findings

01

All model variants outperform latent tree learning baselines.

02

Models perform competitively with symbolic grammar induction systems.

03

Neural language modeling shows empirical success for grammar induction.

Abstract

A substantial thread of recent work on latent tree learning has attempted to develop neural network models with parse-valued latent variables and train them on non-parsing tasks, in the hope of having them discover interpretable tree structure. In a recent paper, Shen et al. (2018) introduce such a model and report near-state-of-the-art results on the target task of language modeling, and the first strong latent tree learning result on constituency parsing. In an attempt to reproduce these results, we discover issues that make the original results hard to trust, including tuning and even training on what is effectively the test set. Here, we attempt to reproduce these results in a fair experiment and to extend them to two new datasets. We find that the results of this work are robust: All variants of the model under study outperform all latent tree learning baselines, and perform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nyu-mll/PRPN-Analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications