ListOps: A Diagnostic Dataset for Latent Tree Learning

Nikita Nangia; Samuel R. Bowman

arXiv:1804.06028·cs.CL·April 18, 2018

ListOps: A Diagnostic Dataset for Latent Tree Learning

Nikita Nangia, Samuel R. Bowman

PDF

3 Repos

TL;DR

This paper introduces ListOps, a toy dataset designed to evaluate the parsing capabilities of latent tree learning models, revealing their inability to learn correct parses even on simplified tasks.

Contribution

The paper presents ListOps as a new diagnostic dataset that isolates parsing ability, demonstrating that current latent tree models fail to learn proper syntax even in simplified settings.

Findings

01

Latent tree models perform worse than sequential RNNs on ListOps.

02

Current models cannot learn the correct parsing strategy for ListOps.

03

ListOps provides a controlled environment for studying parsing in neural models.

Abstract

Latent tree learning models learn to parse a sentence without syntactic supervision, and use that parse to build the sentence representation. Existing work on such models has shown that, while they perform well on tasks like sentence classification, they do not learn grammars that conform to any plausible semantic or syntactic formalism (Williams et al., 2018a). Studying the parsing ability of such models in natural language can be challenging due to the inherent complexities of natural language, like having several valid parses for a single sentence. In this paper we introduce ListOps, a toy dataset created to study the parsing ability of latent tree models. ListOps sequences are in the style of prefix arithmetic. The dataset is designed to have a single correct parsing strategy that a system needs to learn to succeed at the task. We show that the current leading latent tree models are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.